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Abstract 

We reconsider the conceptual foundations of the renormalization-group (RG) 
formalism, and prove some rigorous theorems on the regularity properties and 
possible pathologies of the RG map. Regarding regularity, we show that the 
RG map, defined on a suitable space of interactions (= formal Hamiltonians), is 
always single- valued and Lipschitz continuous on its domain of definition. This 
rules out a recently proposed scenario for the RG description of first-order phase 
transitions. On the pathological side, we make rigorous some arguments of Grif- 
fiths, Pearce and Israel, and prove in several cases that the renormalized measure 
is not a Gibbs measure for any reasonable interaction. This means that the RG 
map is ill-defined, and that the conventional RG description of first-order phase 
transitions is not universally valid. For decimation or Kadanoff transformations 
applied to the Ising model in dimension d > 3, these pathologies occur in a 
full neighborhood {/3 > /3o, \h\ < e(/3)} of the low-temperature part of the first- 
order phase-transition surface. For block-averaging transformations applied to 
the Ising model in dimension d > 2, the pathologies occur at low temperatures 
for arbitrary magnetic-field strength. Pathologies may also occur in the critical 
region for Ising models in dimension d > 4. We discuss in detail the distinc- 
tion between Gibbsian and non-Gibbsian measures and the possible occurrence 
of the latter in other situations, and give a rather complete catalogue of the 
known examples. Finally, we discuss the heuristic and numerical evidence on 
RG pathologies in the light of our rigorous theorems. 
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1 Introduction and Summary of Results 
1.1 General Introduction 

A principal tenet of the renormalization-group (RG) theory of phase transitions [365, 
277, 254, 118] is that the RG map, defined on a suitable space of Hamiltonians, is 
smooth (i.e. analytic or at least several-times differentiable), even on phase-transition 
surfaces. The singularities associated with critical points [365, 277, 254, 118] and first- 
order phase transitions [280, 217, 119] are then explained in terms of the behavior of 
the RG map under infinite iteration. 

This picture of a smooth RG map has, however, been questioned, particularly as 
regards the behavior at or near a first-order phase transition. On the one hand, the 
existence of several phases raises the possibility that the RG map may be discontinuous 
or multi-valued [173, 197, 305] on the first-order transition surface, as the numerical 
evidence reported by several groups [33, 233, 72, 163] seems to suggest. On the other 
hand, Griffiths and Pearce [172, 173, 171] have pointed out some "peculiarities" of the 
commonly used discrete-spin position-space RG transformations (decimation, majority 
rule, etc.); in particular they suggested that the RG map for the two-dimensional Ising 
model must have singularities (or other strange behavior) in a rather large part of 
(/3, /i)-plane (see also [54, 334]).^ In an important but apparently little-known paper, 
Israel [207] clarified the nature of the Griffiths-Pearce peculiarities: he showed that 
in at least one case the renormalized system cannot be described by a Boltzmann- 
Gibbs prescription for any reasonable Hamiltonian, i.e. the renormalized measure is 
non-Gibbsian. 

In this paper^ we reconsider the conceptual foundations of the RG formalism, and 
prove some rigorous theorems on the nature of the RG map. On the one hand, we 
prove two Fundamental Theorems on the single-valuedness and continuity of the RG 
map; these theorems rule out the discontinuous-fiow scenario proposed in references 
[33, 233, 72, 163, 197, 305]. On the other hand, we prove, completing and extending 
Israel's argument, that in several cases the RG map is ill-defined for a much more basic 
reason: the renormalized Hamiltonian may fail to exist altogether. This implies that 
the conventional RG description of first-order phase transitions [280, 217, 119] is not 
valid either (at least in these models and for these RG transformations). Moreover, this 
pathology can occur in the vicinity of — not only at — a first-order phase transition: for 
the Ising model in dimension J > 3 it occurs in a full neighborhood {/3 > /3o, |^| < e(/3)} 
of the low-temperature part of the first-order phase-transition surface. Indeed, for 
certain block-averaging transformations we are able to show that the pathology occurs 
at low temperature and all magnetic fields h. 

Our point of view is the following: An RG map is defined initially as a rule (which 



^Similar peculiarities, and also different ones, have been found by Hasenfratz and Hasenfratz [189]. 
The phenomena found in Section 4 of their paper are very closely related to those of Griffiths and 
Pearce, while those in Sections 2 and 3 seem to be quite different. 

^ Brief summaries of our results have appeared previously [352, 353, 354, 355]. 



may be either deterministic or stochastic) for generating a configuration uj' of "block 
spins" given a configuration cu of "original spins". Mathematically this is given by 
a probability kernel T{uj — )• cu'). Using such a map, one can immediately define a 
probability distribution fi'{io') of block spins from any given probability distribution 
fi{io) of original spins, namely 



fi'{uj') 



{fiT){uj') = ^/i(cu)r(cu 



id 



:i.i^ 



In other words, the RG map is easily defined as a map from measures to measures. On 
the other hand, most applications of the renormalization group assume (and in fact 
need) that the RG map is defined as a map from Hamiltonians to Hamiltonians . That 
is, if /i is the Gibbs measure for a statistical-mechanical system with Hamiltonian 
i7, then one usually assumes that //' is the Gibbs measure for a system with some 
Hamiltonian H'] this is taken to define an RG map TZ on some suitable space of 
Hamiltonians, by the diagram 



T i 

n 



fiT 



:i.2) 



H 



H' 



Formally the relation between a Hamiltonian and its corresponding Gibbs measure is 



given by /i = const X e' 
defined formally by 



-H 



and hence the RG map on the space of Hamiltonians is 



H'[ 



u 



{nH){ 



u 



log 



Z^' 



-H(uj) rp. 



U 



u 



+ const 



:i.3) 



However, this formula is valid only in finite volume] in infinite volume, the Hamiltonian 
H[(jj) is ill-defined (its value is almost surely ±00), and the connection between a formal 
Hamiltonian (more precisely, an interaction) and its corresponding Gibbs measure(s) 
is much more complicated. We emphasize that this is not a mere mathematical nicety, 
but contains the fundamental physics of phase transitions. In finite volume, where 
the formula // = const X e~ makes sense, all thermodynamic functions are manifestly 
analytic functions of the parameters in the Hamiltonian, so a phase transition is im- 
possible. Phase transitions occur only for infinite- volume systems. Now, one feature of 
the infinite- volume limit is the possibility that the Gibbs measure may be non-unique: 
corresponding to a given formal Hamiltonian (= interaction) there may exist several 
distinct Gibbs measures, each one corresponding to a distinct thermodynamically sta- 
ble "pure phase" of the system. Indeed, such a multiple-phase coexistence can serve 
as one definition of a first-order phase transition. Therefore, for Hamiltonians H with 
a non-unique Gibbs measure (= Hamiltonians lying on a first-order phase-transition 
surface), the upward vertical arrow in (1.2) may well be a multi-valued map; and one 
might fear that this could cause the putative RG map TZ to become multi-valued as 
well. (We shall see later, however, that this pathology cannot occur.) Even more sub- 
tle problems arise from the downward vertical arrow in (1.2): though at most one H' 



8 



can correspond to a given //' [174], it can happen that no H' corresponds to the given 
/i' — that is, it can occur that the image measure fi' is not a Gibbs measure for any 
reasonable Hamiltonian. In Section 3 we shall show that such non-Gibbsianness is the 
only way that the RG map can become grossly "pathological". In Section 4 we shall 
show that this pathology does in fact occur in a rather wide variety of examples. 

(Of course, we must make precise what we mean by a "reasonable" Hamiltonian, and 
convince the reader that our class is sufficiently wide to capture fully the intuitive notion 
of "physical reasonableness". This will be discussed in detail in Sections 2 (especially 
2.3.3, 2.4.4 and 2.6.7) and 6.1.2. Suffice it to say now that we allow interactions 
of arbitrarily long range and involving arbitrarily many spins, subject only to the 
condition of absolute summability.) 

These results leave RG theory in roughly the following situation: The RG map 
has been proven to be well-defined and analytic at high temperature [207, 212] and, 
in some cases, at large magnetic field [173] — regions in which phase transitions are 
absent, and RG theory is unnecessary. The RG map has been proven in some cases 
to be ill-defined at low temperature (Section 4). Near the critical point — where RG 
theory is of the most interest — very little is known about the behavior of the RG map, 
but there are some indications of possible pathologies in dimensions d ^) 4 (Sections 
4.4 and 5.2). Nevertheless, RG ideas have been of great value even in situations in 
which the strict Wilson prescription (1.2) has not been — and maybe even cannot be 
— implemented [147, 148, 150, 149, 151, 181, 188, 53, 134, 135, 137, 4, 3, 112, 145, 
146, 44, 52, 183, 184, 185, 187, 186]. We discuss these issues further in Section 6.1. 

1.2 Plan of This Paper (Or, What to Read and What to 
Skip) 

We hope that this paper will be read (and readable) both by theoretical physicists — 
particularly those doing real-space RG and Monte Carlo RG calculations — and by 
mathematical physicists interested in the statistical mechanics of lattice systems. For 
this reason we have given in Section 2 a rather detailed (and, we hope, comprehensible) 
summary of the general theory of infinite- volume lattice systems, in which we make pre- 
cise the concepts of "interaction", "Hamiltonian", "Gibbs measure" and "equilibrium 
measure" and the connections between them. As we have argued, a careful treatment 
of the infinite- volume problem is essential for a correct physical understanding of phase 
transitions in general, and of the renormalization group in particular. We hope that 
Section 2 will be useful to physicists who may not be familiar with these ideas. Some 
abstract mathematics is of necessity involved; we have tried hard to minimize "mathe- 
matics for the sake of mathematics", and to introduce only those mathematical objects 
which correspond to clear physical concepts. The reader can judge whether we have 
succeeded."^ 



^The "experts" will notice a few innovations and new results in Section 2 and the associated 
Appendix A: the extensive discussion of physical equivalence (Sections 2.3.5, 2.4.3, 2.4.5, 2.4.6, A. 3. 4, 
A. 3. 5 and A. 3. 7); some precise estimates on bulk vs. surface effects (Sections 2.4.7, 2.4.8 and A. 3. 8); 



In Section 3 we define our general framework for studying renormalization transfor- 
mations, and prove the two Fundamental Theorems on single- valuedness and continuity 
of the RG map. These theorems show that the RG map TZ can never become multi- 
valued or discontinuous; but it can become non-valued, which occurs if the image 
measure //' is non-Gibbsian. This focus on non-Gibbsianness — which is the real mes- 
sage of our paper — is a profound insight due to Israel [207]. In Section 4 we complete 
and extend Israel's argument, and show that in a large class of examples (always at low 
temperature, but not only on phase-transition surfaces) the image measure //' is indeed 
non-Gibbsian. We also discuss some other operations that can lead to non-Gibbsian 
measures, including one which is relevant to "large-cell" majority-rule maps; and we 
give a rather complete catalogue of the known examples of non-Gibbsianness. In Sec- 
tion 5 we discuss the heuristic and numerical evidence on RG pathologies in the light 
of our rigorous theorems. We also discuss some heuristic arguments for the possible 
existence of RG pathologies in the critical region for Ising-to-Ising RG maps in dimen- 
sion J(^)4. In Section 6 we summarize our results and discuss their implications. We 
conclude with a list of open questions. 

In Appendix A we supply the proofs of some theorems that are stated without 
proof in Section 2. In Appendix B we provide a brief summary of Pirogov- Sinai theory, 
which is needed as a technical tool in Section 4."* In Appendix C we solve a Diophantine 
equation arising in our study of the majority-rule map. 

Let us again express our hope that the reader will at least peruse Section 2. (Hey, 
we spent a long time on it, and we think it is rather good pedagogy.) However, for 
the reader who is truly allergic to abstract mathematics, we offer the following advice: 
read the remainder of this Introduction, followed by Sections 3 (skipping the proofs), 
4.1.1, 4.4 (skipping the proofs), 5.2 and 6. Finally, for the reader who is allergic both to 
abstract mathematics and to 250-page papers, we offer "RG lite": read the remainder 
of this Introduction, and then skip to the Conclusion (Section 6.1). 

1.3 Summary of First and Second Fundamental Theorems 

We would like next to summarize the two Fundamental Theorems and give the physical 
intuition behind their proofs. Consider, for concreteness, the Ising model in dimension 
d > 2 at low temperature (/3 ^ /3c) and zero magnetic field. At such a point there 
are precisely two [141] pure phases (extremal translation-invariant Gibbs measures): 
the positively magnetized (or "-|-") phase /i_|_, and the negatively magnetized (or "— ") 
phase /i_. These pure phases can be obtained by taking the infinite- volume limit 



a consistent use of van Hove convergence and complete subadditivity (Sections 2.4.1, A. 3. 3, A. 3. 4 and 
A. 3. 5); and some interesting counterexamples concerning the pressure and entropy (Appendix A. 5. 2). 
The first two of these innovations play a crucial role in our proof of the Second Fundamental Theorem 
(Section 3.3). 

■*The "experts" will notice some small innovations in our presentation of Pirogov-Sinai theory, 
notably our emphasis on questions of uniformity . This plays an important role in our application to 
the Kadanoff transformation: see Section 4.3.3 and Appendix B.5.4. 
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using "+" or "— " boundary conditions, respectively. Both of these phases have a large 
magnetization ibAfo and a small correlation length ^. Now let us apply some block- 
spin transformation T, such as the majority-rule transformation on blocks of size 2'^. 
Then the image measures /i'_j_ = fi±T will have a yet larger magnetization zbAfg (since 
minorities tend to get "outvoted") and a yet smaller correlation length ^' (we expect 
roughly ^' sa (^/2, since distances are being scaled by a factor of 2). We now ask: These 
image measures /i'_j_ are typical of what kind of Hamiltonian (if any)? 

One possibility — and the one conventionally assumed [280, 217, 119] — is that the 
RG flow is toward lower temperatures along the h = line} This picture is certainly 
consistent with the intuitive idea that magnetization increases and correlation length 
decreases under the RG map. In this scenario [Figure 1(a)], the two image measures 
//'_)_ would be Gibbsian for the same Hamiltonian H\ and this Hamiltonian would be 
invariant under the cr — )• — cr symmetry. 

A different possibility was advocated by Decker, Hasenfratz and Hasenfratz [72]. 
In this scenario [Figure 1(b)], the RG flow is discontinuous at the phase-transition 
line /i = 0: Hamiltonians H with an infinitesimal positive (resp. negative) magnetic 
field h get mapped by a single RG step to renormalized Hamiltonians H' having a 
strictly positive (resp. strictly negative) magnetic field h' . Furthermore, at h = the 
renormalized Hamiltonian H' depends on which pure phase, /i_|_ or //_, one uses in the 
top left corner of (1.2): the image measure /i'_j_ would be Gibbsian for some Hamiltonian 
i7^ having (among other couplings) a strictly positive magnetic field, while the image 
measure //'_ would be Gibbsian for some Hamiltonian H'_ having a strictly negative 
magnetic field. (Obviously i7^ and H'_ would be related by the a — )• —a symmetry, 
i.e. by reversing the signs of all odd couplings.) In this scenario, therefore, the RG 
map TZ is discontinuous as one approaches the phase-transition line, and multi-valued 
on that line.^ This picture is also consistent with the intuitive idea that magnetization 
increases and correlation length decreases under the RG map. 

How can we distinguish between these two scenarios? Otherwise put: Suppose we 
are given a measure //' with a large positive magnetization and a small (but nonzero) 
correlation length. Does this measure come from a Hamiltonian H' with /3 large and 
/i = 0, or does it come from a Hamiltonian with /3 not so large (possibly even small) 
and h large and positive? Both of these regions in the (/3, h)-plane correspond to a 
large positive magnetization and a small correlation length. How can we distinguish 
between the two? 

The answer has to do with the large-deviation properties of the measure //'. Let 
A be a large cubical box of side Z, and let Ma = Y^xeA ^x be the total spin in A 



®More precisely, the flow would take place in an inflnite-dimensional space of couplings, but would 
respect the a -^ —a symmetry. That is, second-nearest-neighbor and longer-distance pair couplings, 
four-spin couplings, six-spin couplings and so forth would certainly be induced; but no magnetic flelds, 
three-spin couplings or other odd interactions would arise. 

®This possibility was suggested earlier, in the context of the 3-state Potts model in three dimensions, 
by Blote and Swendsen [33] and with especial clarity by Rebbi and Swendsen [305, p. 4099]. It was 
also suggested, in the context of a mean-fleld computation, by Hudak [197]. 
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h = +00 



h = 



h=-oo 




(a) 



h = +00 



h = 



h=-oo 




00 



(b) 



Figure 1: Two possible scenarios for the RG flow in the Ising model at low temperature. 
(a) RG map is continuous and single-valued on the phase-transition line, (b) RG map 
is discontinuous and multi-valued on the phase-transition line. 
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(a random variable). Clearly there is an overwhelming probability that Ma will be 
positive (and in fact very close to its mean value L M'q = L (cr)^/); but how rare is it 
to have Ma negative? If //' is a Gibbs measure for some Hamiltonian with /i > 0, then 
the event Ma < is suppressed by the bulk magnetic field: 

Prob^,(A^A < 0) ~ e-°(^') . (1.4) 

On the other hand, if //' is a Gibbs measure for some Hamiltonian with h = and 
/3 > /3c, then the event Ma < is suppressed only by a surface energy: 

Prob^,(A^A < 0) ~ e-°(^'"') . (1.5) 

It is now easy to decide between the two scenarios for the RG fiow. In the starting 
measure /i_|_, the occurrence of a large region with negative total spin is suppressed only 
like e~'^(^ >; roughly speaking, the measure /i_|_ "knows" that it is degenerate with 
the measure //_. But then in the block-spin measure /i'_j_ = /U+T, there must also be a 
probability > e~'^^^ > of observing a negative total spin (since a net negative original 
spin implies, with high probability, a net negative block spin). Since this contradicts 
(1.4), we conclude that /i'_j_ cannot be the Gibbs measure of a Hamiltonian with strictly 
positive magnetic field. Picturesquely, the image measure /i'_j_ "remembers" that it arose 
from an original Hamiltonian H with coexisting phases. Therefore, the discontinuous- 
fiow scenario is impossible; the RG map cannot be multi-valued or discontinuous. 

It is a relatively short step from these intuitive ideas to a rigorous proof. In Section 
3 we prove, in great generality, the following two theorems: 

First fundamental theorem. If // and u are Gibbs measures for the 
same interaction, then either fiT and pT are both non-Gibbsian, or else 
there exists an interaction for which both fiT and pT are Gibbs measures. 
In the latter case, this is the only interaction for which either fiT or pT is 
a Gibbs measure. Therefore, the renormalization-group map TZ cannot be 
multi-valued. 

Second fundamental theorem. The renormalization-group map TZ is 
continuous (in fact, Lipschitz continuous) on the domain where it is defined. 

Of course, these summaries of the theorems are not quite precise: we need to make 
clear, for example, in what space of interactions we are working, and in what norm we 
are defining continuity. The detailed statement of the Fundamental Theorems can be 
found in Sections 3.2 and 3.3, respectively. The proofs of the Fundamental Theorems 
are based on the general theory of infinite- volume lattice systems developed in Section 
2. These two theorems make clear that the only way in which the RG map can become 
grossly pathological is for it to be undefined^ i.e. for the image measure fi' to be non- 
Gibbsian. 
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1.4 Summary of GrifRths-Pearce-Israel Pathologies 

This is not, however, the end of the story: although the discontinuous-flow scenario for 
the RG map in the low-temperature Ising model is not correct, the traditional scenario 
is in many cases not correct either! The First Fundamental Theorem leaves open the 
possibility that the image measure //' = fiT may be non-Gihhsian, in which case the 
RG map TZ would be undefined . It turns out that this pathology does in fact occur in 
a rather wide variety of examples. The occurrence of non-Gibbsianness for the image 
measure //' was flrst pointed out by Israel [207] in one of the cases suggested by Griffiths 
and Pearce [173, 171]. In Section 4 we complete and extend Israel's argument, and 
show that in a large class of examples (always at low temperature, but not only on 
phase-transition surfaces) the image measure //' is non-Gibbsian. 

The non-Gibbsianness arises from the fact — already noted by Griffiths and Pearce 

— that the "internal spins" (the variables being integrated over in the RG transfor- 
mation) may undergo a first-order phase transition for some fixed block-spin configu- 
ration cUg j^j. Moreover, in some cases the different phases (= Gibbs measures) of the 
internal-spin system can be selected by an appropriate choice of block-spin boundary 
conditions. In this way, information can be transmitted from distant block spins to the 
block spin at the origin via the internal spins in the intermediate region, even when the 
block spins in the intermediate region are fixed. As a consequence, the renormalized 
measure //' violates a very weak locality condition — quasilocality ^ see Section 2.3.3 

— which is obeyed by every Gibbs measure coming from a reasonable interaction. It 
follows immediately that the renormalized measure //' must be non-Gibbsian. 

It is at first surprising that the existence of pathologies for a single block-spin 
configuration cu^ j^j — which has, of course, probability zero — can nevertheless cause 
the non-Gibbsianness of the renormalized measure; and indeed, this fact alone is not 
sufficient for concluding non-Gibbsianness. Rather, what happens in these examples is 
that for block-spin configurations which are near (in the product topology) to cu^ j^j 

— namely, those which agree with cu^ j^j in a large cube and differ from it outside — 
the internal-spin phase depends sensitively on the block spins outside the cube. These 
configurations have a small but nonzero probability, and this turns out to be sufficient 
for proving non-Gibbsianness. The details of the proof are given in Sections 4.1-4.3. 

We prove non-Gibbsianness at low temperature and zero magnetic field in the fol- 
lowing examples: 

• Decimation with any spacing 6 > 2, for the Ising model in any dimension d > 2. 

• The Kadanoff transformation with finite p and arbitrary block size 6 > I, for the 
Ising model in dimension d > 2J 

• The majority-rule transformation with 7x7 (or 41 X 41, 239 X 239, . . . ) blocks 
for the two-dimensional Ising model. 



^In earlier versions of this work [352, 353], we claimed this result only for small p. Subsequently 
we found a proof valid for all < p < co, which we present here. 
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• Averaging transformation with any even block size 6 > 2, for the Ising model in 
any dimension d > 2. 

Moreover, in several cases we can prove that these pathologies are present also at 
nonzero magnetic field. For the first two examples, we prove non-Gibbsianness in di- 
mension J > 3 in a full neighborhood {/3 > /3o, |^| < e(/3)} of the low-temperature part 
of the first-order phase-transition surface. In the last example, the pathologies can be 
proven in any dimension d > 2 and for arbitrary values of the magnetic field, again 
at low temperature. These latter results make clear that the Griffiths-Pearce-Israel 
pathologies are not associated with the fact that the original model is sitting on a 
phase-transition surface. Rather, it suffices that a first-order phase transition can be 
induced in the internal-spin system by choosing an appropriate block-spin configura- 
tion. For this we need to work at low temperature but not necessarily close to the 
phase-transition surface. 

Though we have not yet been able to demonstrate non-Gibbsianness for the majority- 
rule transformation on 2 X 2 or 3 X 3 blocks, or for any block size in dimension (/ > 3, 
we feel that the obstacles are technical rather than fundamental. Indeed, the results 
in Section 4 suggest that non-Gibbsianness may be the normal situation for RG maps 
at low temperature and/or near a first-order phase transition. 

The reader will probably not be surprised that the decimation transformation is 
"pathological": this transformation, unlike other RG transformations, does not in 
any sense integrate out the "high-momentum modes" and leave the "low-momentum 
modes"; it merely integrates out one sublattice and leaves another. In particular, if 
the sublattice of internal (integrated-out) spins is connected^ it is hardly surprising 
that the internal-spin system can exhibit a phase transition, and that this can give 
rise to RG pathologies. We therefore want to stress that the same pathology — non- 
Gibbsianness after one renormalization step — is also present at low temperature for at 
least some Kadanoff, majority-rule and block-averaging transformations. These latter 
transformations do (at least seemingly) integrate out the "high-momentum modes" 
and leave the "low-momentum modes", and they have been generally considered to 
be well-behaved. Indeed, nearly all real-space RG studies of Ising models have used 
some variant of these transformations. It is thus a highly non-trivial fact that these 
RG maps can be ill-defined at low temperature. 

2 Infinite- Volume Lattice Systems: General For- 
malism 

Consider a classical statistical-mechanical system with configuration space 0, Hamil- 
tonian H and a priori measure //q- The Boltzmann-Gibbs distribution fisG for this 
system in the canonical ensemble at inverse temperature /3 can be characterized in 
either of two ways: 

(a) Explicit formula. 

dfiBoioj) = Z-' e-^^'^-Ufioioj) , (2.1) 
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where of course 

Z = I e-f^^^^Ufio{oj) . (2.2) 



(b) Variational principle. fisG is that probabihty measure which maximizes entropy 
minus /3 times mean energy: 



where 



and 



fJ-BG maximizes S{fi\fio) — l3E[H^fi) , (2-3) 

5'(/i|/io) = -jilog — jd^ 

= -/^log^)*„ (2.4) 

E{H,fi) = {H), = fH{u)dfi{u). (2.5) 



The equivalence of these two characterizations is a simple computation in the calculus 
of variations. 

Unfortunately, this elementary theory does not apply to infinite-volume systems, 
because the Hamiltonian H{lo) is ill-defined: for almost any configuration u we have 
H[(jj) = iboo. Nevertheless, non-trivial analogues of these two characterizations can 
be developed for infinite- volume systems. The analogue of the explicit formula is 
the theory of specifications and Gibbs measures: an infinite- volume Gibbs measure is 
one whose conditional probabilities for finite subsystems are given by the Boltzmann- 
Gibbs formula. The analogue of the variational approach is the theory of equilibrium 
measures: an equilibrium measure is a translation-invariant measure that maximizes 
entropy density minus fi times mean energy density. These approaches are reviewed 
in Sections 2.3 and 2.5-2.6, respectively. The fundamental feature of infinite-volume 
systems, which distinguishes them from finite- volume systems, is that the map between 
"Hamiltonians" (more precisely, interactions) and Gibbs measures (or equilibrium mea- 
sures) is neither single-valued nor onto: some interactions have multiple Gibbs mea- 
sures, while some measures are not Gibbsian for any interaction. These facts are at 
the heart of the theory of phase transitions, and of the renormalization group. 

The standard references for the material in this section are the books of Georgii 
[157], Preston [299] and Israel [206]. Georgii and Preston deal principally with the 
theory of Gibbs measures, while Israel deals principally with the theory of equilibrium 
measures. 

We assume in this section that the reader has some knowledge of metric spaces and 
Banach spaces, ideally at the level of Roy den [309] or Reed and Simon [306], and of 
measure theory and probability theory, ideally at the level of Bauer [25] or Krickeberg 
[224]. However, we realize that for many readers these theories belong to only faintly 
remembered mathematics courses and are rather distant from their day-to-day work 
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in theoretical physics. Nevertheless, we urge such readers not to be discouraged by the 
abstract jargon, and to use the examples we provide as a means to grasp the essential 
physical ideas underlying the mathematics. 

In this section the emphasis is on concepts and ideas (both physical and mathe- 
matical), not on techniques of proof. Therefore, all definitions and theorems are stated 
precisely, but proofs are omitted. In Appendix A we provide, for each theorem, either 
a published reference (if the result is known) or a proof (if it is new). 

Henceforth we absorb /3 into the Hamiltonian H; this simplifies the notation. Let us 
also remark that although our exposition is couched in the language of the canonical 
ensemble, the formalism is equally applicable to the grand canonical ensemble: it 
suffices to interpret our H to mean "/3i7 — l3fiN^\ In fact, this formalism applies 
to an arbitrary "generalized (grand) canonical ensemble" with parameters /3i,...,/3„ 
conjugate to observables i^i, . . . , Hn- 

2.1 Configurations, Events, Functions, Measures 

8 

Classical statistical mechanics is a branch of probability theory. The basic struc- 
tures of probability theory are: 

• A configuration space — this is the set of all possible (microscopic) configura- 
tions of the system under study. 

• A cr-field JF of subsets of — this is the set of all events (= yes-no questions) that 
are measurable by some conceivable (possibly extremely idealized) experiment. 
Various sub-cr-fields A C J-" may correspond to restricted classes of experiments 
(e.g. experiments performed within a specified region of space). 

• Observables (= random variables = real- valued jF-measurable functions on ft) — 
these correspond to experiments which give a real number as an answer. Various 
subclasses of observables (e.g. those measurable with respect to a specified sub- 
cr-field A) may correspond to restricted classes of experiments (e.g. experiments 
performed within a specified region of space). 

• A probability measure (= probability distribution) fi on (0,JF) — this describes 
either our state of partial knowledge of the system (if we take a "subjective" inter- 
pretation of probability theory) or an ensemble of "identically prepared" random 
systems (if we take an "objective" interpretation of probability theory). The 
mathematics of statistical mechanics does not depend on any particular interpre- 
tation of its mathematical objects, so the reader is urged to employ whichever 
interpretation he/she prefers. 

In this section we describe the particular case of this structure that is appropriate for 
the equilibrium statistical mechanics of an infinite-volume classical lattice system. 



^A reference for this section is Georgii [157, Introduction and Sections 1.1 and 2.2]. 
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2.1.1 Configurations and Events 

The configuration space of an infinite- volume lattice system is specified by the following 
ingredients: 

• The single-spin space Oq- This is the space of possible configurations of the 
physical variable(s) at a single lattice site. (For brevity we call these variables a 
"spin".) Examples: Ising model, Oq = {"l?!}; iV-vector model, Oq = S^~'^ = 
unit sphere in R ; A^-component Gaussian or Lp'^ model, Oq = R ; solid-on-solid 
(SOS) or discrete Gaussian model, Oq = Z. 

Since statistical mechanics is based on probability theory, we shall always as- 
sume Oo to be equipped with a cr-field J-'q of "measurable sets". Usually Oq will 
also come equipped with a physically natural topology; in fact, Oq will almost 
always be a complete separable metric space, and J-'q will be the cr-field of Borel 
sets. If Oo is a compact metric space, we say that the system has bounded spins; 
otherwise we say that the system has unbounded spins. Examples: The Ising 
and A^-vector models have bounded spins; the Gaussian, (^'*, SOS and discrete 
Gaussian models have unbounded spins. 

• The lattice C — a countably infinite set of "sites". For the moment we need not 
give C any geometric structure, but for concreteness the reader can imagine C to 
be some J-dimensional lattice. 

The infinite- volume configuration space is then defined to be the Cartesian product 
(Oo)'^; that is, it is the set of all configurations uj = {ijJx)xec with uj^ £ Oq for each site 
X. The space is equipped with the product cr-field JF = (J^o)^ and with the product 
topology.^ The product topology means that a sequence (or net) of configurations (cu") 
converges to a configuration io if and only if cu" — )• cOx for all x ^ C. If Oq is metrizable 
(resp. separable, complete metric, compact), then so is ft. 

It is important to understand physically what the product topology means. Suppose 
for simplicity that Oq is a metric space. Then a typical neighborhood of cu G is the 
set 

J^Lo,e,A = W- dist{ujx,uj'^) < e for all x G A} , (2.6) 

where e > and A is a finite subset of £.^° That is, a typical neighborhood of io in the 
product topology is the set of configurations that are close to io on some finite set of 
sites A, but are arbitrary outside A. In particular, if Oq is discrete (as e.g. in the Ising 
model), then a neighborhood of io is the set of configurations that agree with u on some 
finite set of sites A, but are arbitrary outside A. These facts will play an important 
role in our discussion of non-Gibbsianness for RG image measures (Sections 4.1-4.3). 



^If ilo is a separable metric space, then the product cr-field of the individual Borel cr-fields coincides 
with the Borel cr-field for the product topology. 

"'^'^More precisely, the sets Wa),e,A form a neighborhood basts of ui, i.e. every neighborhood of ui 
contains one of the sets Afuj c A- 
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For each subset A C >C, we let Tk C JF be the sub-cr-field corresponding to events 
depending only on the spins cua = {ujx)x^k] that is, Tk is the cr-field of events measurable 
within the subset A. We denote by S the class of all nonempty finite subsets of C. We 
denote by A^ the complement of A in £. 

Remark. The Cartesian product (flo)^ is not the most general configuration space. 
Often one wishes to study a lattice model with local constraints (e.g. hard-core exclu- 
sions). One way (not the only one) to treat these constraints is to cut the excluded 
configurations out of the configuration space: that is, we let the configuration space 
be an appropriate subset of the product space (Oq)'^. We do not allow this much 
generality here, but much of the present theory goes through (with some modification) 
in this situation [313, 269, 15]. 

2.1.2 Functions (= Observables) 

An observable is simply a real-valued measurable function on ft. We consider various 
spaces of such functions: 

• The space B[ft) = BiJl^J^) of bounded measurable functions. This is the largest 
space of functions we shall consider. 

• The space Biodft) = U^e^ -^(^7 -^a) of bounded local functions. A function is 
local if it depends on only finitely many spins. 



• The space Bqi[i}) = Biodft) of bounded quasilocal functions. A function is 
quasilocal if it is the uniformly convergent limit of some sequence of local func- 
tions. Equivalently, a function is quasilocal if it "depends weakly on distant 
spins" in the sense that^^ 

lim sup \f{u)-f{u')\ = 0. (2.7) 

• The space C(0) of bounded continuous functions. 

• The space Ciodft) = Biodft) fl C(0) of bounded continuous local functions. 

• The space Cqi{ft) = Bqi[ft) fl C(0) of bounded continuous quasilocal functions. 

Examples. 1. For Oq = R, the function f{(fi) = sgn((^o) is bounded and local 
(hence quasilocal) but not continuous. Analogous functions can obviously be con- 
structed for Oo = S^~'^ or R , and indeed on any single-spin space which is not 
discrete. 



"'^"'^The statement lim F(A) = a (where a G R or C) means that for each e > 0, there exists a finite 

subset Kf C jC such that \F(A) — a\ < e whenever A Z) Kf. Statements lim_F(A) = -|-cxd or — cxd are 

to be interpreted analogously. (Mathematicians call this "convergence along the net of finite subsets 
of £, directed by inclusion".) Please do not confuse this notion of convergence with the limit in the 
sense of van Hove, to be defined in Section 2.4.1. 
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2. If Oo = 'C = Z, any (bounded) function of cr^j^ is (bounded and) continuous but 
not quasilocal. (This example, which was suggested to us by Hans-Otto Georgii, is 
further discussed in Appendix A.l.) 

We equip each of the above spaces with the "supremum norm" (or "uniform norm" ) 

11/11 = ll/IU ^ sup|/(c.)|. (2.8) 

So equipped, the spaces i?(0), i?g;(0), C(0) and Cqi{ft) are Banach spaces. Let us 
notice that: 

(a) If the single-spin space Oq is a compact metric space, then every continuous 
function is bounded and quasilocal. Hence C(0) = Cqi{ft) C Bqi[ft). 

(b) If the single-spin space Oq is discrete^ then every quasilocal function is continuous. 
In particular, i?g/(0) = Cg/(0) C C(0). 

(c) If the single-spin space Oq is finite^ then quasilocality and continuity are equiva- 
lent (and imply boundedness). Hence C(0) = Cqi{ft) = Bqi[ft). 

2.1.3 Measures 

Next we study measures on ft. Let Af (0) = Af (0, JF) be the space of finite signed mea- 
sures on 0, and Af_|_i(0) = Af_|_i(0,jF) C Af(0) be the space of probability measures. 
There is a natural duality between spaces of functions and spaces of measures, namely 

(/.,/) = /.(/) = jfdij^. (2.9) 

If is compact, then every bounded linear functional on C(0) arises in this way from 
a finite signed measure (Riesz-Markov theorem); otherwise put, the Banach-space dual 
of C(0) is exactly M(0). 

Let /i be a probability measure on 0; then the support of fi (denoted supp/i) is a 
closed subset of that can be defined in any of three equivalent ways: 

(a) The set of all cu G such that every neighborhood of io has nonzero measure. 

(b) The intersection of all closed sets of measure 1. 

(c) The complement of the union of all open sets of measure zero. 

The key theorem is: if is a separable metric space, then //(supp/i) = 1, so that supp/i 
is the smallest closed set having measure 1. 

We need to discuss what it means for a sequence (or net) of measures //„ to converge 
to a limiting measure //; in other words, we need to equip the spaces Af(0) and Af_|_i(0) 
with a topology. In fact, there are several mathematically natural topologies, each with 
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a distinct physical meaning. The simplest topology is the norm topology defined by 
the total variation norm 

||/i-z/|| = sup |/i(/) - z/(/)| 

l|/||oo<l 

sup |/i(/) - z/(/)| 
/ e c(fi) 

l|/||oo<l 

= 2 sup \fi{A) - v{A)\ . (2.10a) 

A sequence (or net) //„ converges in variation norm to // if ||/i„ — /i|| — )• 0. Physically, 
norm convergence of //„ to fi means that expectation values in //„ converge to those 
in /i, uniformly for all bounded observables /. This is an extremely strong notion of 
convergence, which occurs only rarely in physical applications. Therefore, we introduce 
also the weak topologies induced by the various classes of functions defined in Section 
2.1.2: 

• The bounded measurable topology: iJ^n ^ ij/ii fin{f) -^ /"(/) ^oi" ^-U / ^ BiJl^J^). 
[If the /i„ are probability measures, it suffices to check that /i„(A) — )• /u(A) for all 

• The bounded quasilocal topology: iJ^n ^ ij/ii fin{f) -^ /"(/) ^oi" ^-U / ^ BqiiJl^J^). 
[If the /i„ are probability measures, it suffices to check convergence for / G 

Bioc{^,J^), or alternatively for all A G U -^A-] 

Ae5 

• The (ordinary) weak topology: iJ^n ^ ij/ii fin{f) -^ /"(/) ^oi" ^-U / ^ C(0). 

• The weak quasilocal topology: fin ^^ f- H f'n{f) -^ /"(/) ^oi" ^-U / ^ Cqi{ft^J-'). 
[If the fin are probability measures, it suffices to check convergence for / G 

We emphasize that the convergence is required to occur for each observable / in the 
designated class, but the convergence is not required to be uniform in /. This is 
important, since / could equally well be the local energy density in New York or the 
local energy density on Andromeda; and one should not expect, in most situations, the 
convergence to be uniform on all such observables. This reasoning also suggests that 
the two quasilocal topologies are likely to be the ones of greatest physical relevance. 

Examples. 1. Let Oq = R, and let fin (resp. fi) be the Dirac delta measure concen- 
trated on the configuration in which all the spins take the value 1/n (resp. 0). Then 
fJ'n^f'^n the weak and weak quasilocal topologies, but not in the bounded measurable 
or bounded quasilocal topologies. 

2. Let Oo = {"I?!}? and let //„ be the Dirac delta measure concentrated on the 
configuration which is +1 for all spins at a distance < n from the origin and —1 for 
all other spins. Let // be the Dirac delta measure concentrated on the configuration 
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which is all +1. Then //„—)•// in the bounded quasilocal, weak and weak quasilocal 
topologies, but not in the bounded measurable topology. 

Georgii bases his theory on the bounded quasilocal topology (which he calls the 
"topology of local convergence" or the "^-topology") [157, Chapter 4]; Israel restricts 
attention to compact metric single-spin spaces, and uses mainly the weak (= weak 
quasilocal) topology [206, Chapters II and IV]. 

Finally, let us remark that with respect to the (ordinary) weak topology, Af_|_i(0) is 
separable and metrizable (resp. complete metrizable, compact metrizable) if and only 
if is. Let us also remark that if Oq is separable and metrizable (resp. countable and 
discrete), then the bounded quasilocal topology is stronger than (resp. equal to) the 
(ordinary) weak topology; this is true even though these hypotheses do not imply that 
C(0) C B,i{ft). 

2.2 Interactions and Hamiltonians 

12 

As discussed in the Introduction to this section, the Hamiltonian H{lo) for an 
infinite-volume system is an ill-defined object. Therefore we must proceed more cau- 
tiously. We define first the concept of an interaction, which corresponds roughly to the 
idea of a "formal Hamiltonian" or a "set of coupling constants". Then we define the 
finite-volume Hamiltonians corresponding to a given interaction and given boundary 
conditions. 

The (meaningless) Hamiltonian of an infinite- volume system is written formally as 
a sum of terms corresponding to various finite subsets of the lattice: one-body terms, 
two-body terms, three-body terms and so forth. Mathematically this idea is made 
precise as follows: 

Definition 2.1 An interaction (or interaction potential or potential^ is a family $ = 
{^A)Aes of functions $^: — )• R, such that for each A (^ S , the function $^ is 
J-'A-m.easurable (i.e. depends only on the spins in the finite subset A). 

Remark. Note that we do not allow the interaction $^ to take the value -|-oo. 
Therefore, a "hard-core interaction" is not included in our formulation. 



Example. Consider the Ising model whose formal (i.e. meaningless) Hamiltonian 



IS 



2_^ JxyiOxiOy — 2_^hxiOx ■ (2-11) 

(xy) ^ 

This model is defined (meaningfully!) by the interaction 

{—hxiOx if A = {x} 

-J^yUJ^ujy ifA = {x,y} (2.12) 

otherwise 



^^References for this section are Georgii [157, Section 2.1] and Israel [206, Sections I.l and 1.2]. 
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The next step is to define tlie Hamiltonian Hf corresponding to an interaction $ 
acting in a finite volume A. Tliis depends, liowever, on wliat boundary conditions one 
cliooses. Tlie simplest case is free boundary conditions: 



Definition 2.2 Let $ be an interaction. Then, for each A ^ S, the Hamiltonian 

'A,/ree 



Hf, /^„„ for volume A with free boundary conditions is the function 



H 



A, free 



E 

Aes 

A C A 



$, 



(2.13) 



Note that this is always a finite sum, so the free-b.c. Hamiltonian is always well-defined. 
Note also that Hf j^^^ depends only on the spins inside A. 

Free boundary conditions are not, however, sufficient: for many purposes we need 
Hamiltonians in which the interior of the volume A is allowed to interact with the 
exterior. To do this, we must consider the bonds that couple a given volume A with 
its exterior; these give a contribution of the form 



W, 



A,A= 



E 



$, 



(2.14) 



Aes 
AnA^ 
AnA^ ^ 



Note that now we are dealing with an infinite sum; therefore we must be careful about 
its convergence. In any case, the Hamiltonian for volume A with general external 
boundary conditions corresponds to adding the contributions (2.13) and (2.14): 

Definition 2.3 Let $ be an interaction. Then, for each A G iS, the Hamiltonian Hf 
for volume A with general external boundary conditions is the function 



m 



[u\ 



E 

Aes 
AnA ^ 



$^(cu) 



H 



A, free 



[u) + W, 



A,A^ 



(iO 



(2.15a) 



(2.15b) 



provided that this sum. converges to a finite limit for all cu G 0, in which case we call 
the interaction $ convergent.^"^ 

Here the convergence is not required to be absolute, nor is it required to be uniform 
in cu; we insist only that the finite- volume Hamiltonian Hf{uj) = H 



a(^a,^a=. 



be well- 



defined for all configurations to (i.e. all pairs consisting of an internal configuration 

/ \ 



^More precisely, what this means is that the net 



limit (for each w G il) as A f £. 



E 

Aes 
A n A 7^ 

A C A 



$^(w) 



converges to a finite 



A6<S 
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cua and an external configuration loa^)- Tliis is a very modest requirement. It rules 
out, however, the use of this formalism for a Coulomb system, in which the interaction 
decays too slowly to be summable in any reasonable sense. ^"^ 

For many purposes it is convenient to think of the configuration outside A as fixed 
(the "boundary condition") and the configuration inside A as variable. Therefore, for 
any fixed r G 0, we define the Hamiltonian Hf^ which uses boundary condition r 
outside the volume A to be 

Hl^{u) = HfiuA X TAc) . (2.16) 

Here cua x tac is the configuration which agrees with cu on A and with r on A^. Note 
that Hf^{uj) depends only on the behavior of uj inside A. 

It is also possible to define the Hamiltonian with other boundary conditions (e.g. 
periodic), but we shall have no need for these. 

The summability properties of the Hamiltonian (2.15a) have important implications 
for the characteristics of the measures constructed with them. In addition to the notion 
of convergence introduced in Definition 2.3 above, we wish to distinguish two stronger 
notions of summability: 

Definition 2.4 We call the interaction $ 

• uniformly convergent if, for every A ^ S, the sum (2.15a) converges uniformly 

in Lo; 

• absolutely summable if, for every A G iS, the sum (2.15a) converges in B[ft) 

norm.. This is equivalent to the condition that Y, ||^a||oo < oo for each 

A es 

A 3 i 

I G C. 

Obviously, absolutely summable implies uniformly convergent, which in turn implies 
the convergence of (2.15a). Some comments and examples are in order: 

1) The physical interpretation of absolute summability is roughly that the maximum 
interaction energy between one spin and the rest of the universe is finite. Alternatively, 
the hipping of one spin produces always a bounded change in energy. 



^■*Lattice Coulomb systems admit a partial thermodynamic treatment based on free boundary con- 
ditions and a carefully taken infinite- volume limit (ensuring overall neutrality of the plasma) [136, and 
references therein]. It may be possible to cast that theory into a generalized version of the Gibbs-DLR 
framework in which the "bad" external configurations — here the non-neutral ones — are made inac- 
cessible to all Gibbs measures [299, pp. 16-18 and Chapter 6]. If so, many (but not all) of the results 
discussed in this section would be valid also for such systems. The case of gravitational systems is 
even worse, because there is no condition analogous to neutrality that can be enforced. These systems 
are not even thermodynamically stable. 
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2) An example of a uniformly convergent interaction which is not absolutely summable 
is the following one-dimensional Ising model [336]: 



$^(cu) 



( — l)"c„ if A is a non-empty set of n adjacent points 

and ujx = +1 for all x G A /q i 7^ 

otherwise 



for a suitable sequence of non-negative numbers (c„)„>i. If nc„ J, 0, this interaction is 
uniformly convergent; but it is not absolutely summable unless Xl™ ^c„ < oo. Thus, 
Cn = n~" with 1 < a < 2 provides the desired counterexample. (In fact, if Xln ^^ = 00 
— for example, c„ = l/nlog(n + 1) — the interaction does not even belong to the 
largest space of interactions considered in the usual thermodynamic formalism, namely 
the space B^ introduced in Section 2.4.4 below.) 

3) Interactions can also be classified according to the maximum spatial distance 
over which they extend: the range of $ is defined to be the supremum of the diameters 
of the sets A with $^ ^ 0. Thus, an interaction $ is of finite range R [R < 00) if 
^A = whenever diam(A) > R. For a finite-range interaction, the sum (2.15a) is a 
finite sum, so $ is (trivially) a uniformly convergent interaction. If, in addition, each 
^A is a bounded function, then $ is absolutely summable. 

4) Let us now introduce two natural pieces of terminology: First, we shall call an 
interaction $ bounded if each $^ is a bounded function. Note that if $ is bounded 
(resp. absolutely summable), then each Hamiltonian Hf .^^ (resp. Hf) is a bounded 
local (resp. bounded quasilocal) function. A bounded interaction, however, may fail to 
be absolutely summable if the bounds ||$a||oo do not decay fast enough. 

5) Second: if, as is usual, the space Oq (and hence ft) comes equipped with a 
topology, then we call an interaction $ continuous if each $^ is a continuous function. 
Note that if $ is continuous (resp. continuous and uniformly convergent), then each 
Hamiltonian Hf j^^^ (resp. Hf) is a continuous function. All the interactions consid- 
ered in this work (and an overwhelming majority of those considered elsewhere) are 
continuous. 

6) If Oo (and hence ft) is compact^ then every continuous interaction is automatically 
bounded. This is one reason why systems of bounded spins are easier to work with 
than systems of unbounded spins. 

7) Nevertheless, as we discuss later (Section 2.3.5), all the properties of an inter- 
action must be interpreted modulo physical equivalence. In this regard, the apparent 
summability properties may turn out to be misleading, as they may change widely 
from one physically equivalent interaction to another [351]. 

2.3 Specifications and Gibbs Measures 

15 



^^References for this section are Georgii [157, Chapters 1-4] and Preston [299, Chapters 1, 2 and 

5]. 
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We now come to the heart of the theory of infinite- volume lattice systems, which is 
to make precise what we mean by an infinite-volume Gibbs measure for a given inter- 
action $. We cannot simply use the explicit formula (2.1), because the infinite-volume 
Hamiltonian H is ill-defined. The traditional solution is to define an infinite-volume 
Gibbs measure to be any measure which is a limit (in a suitable topology) of finite- 
volume Gibbs measures with some chosen boundary conditions. The disadvantage of 
this definition is that it is cumbersome to check: given a measure fi on the infinite- 
volume configuration space, how do we determine whether there exists some sequence 
of finite- volume Gibbs measures converging to //? We would prefer, therefore, to have 
a more direct condition on the infinite- volume measure //. Such a condition was first 
proposed by Dobrushin [83] and Lanford and Ruelle [232]: their idea is to define an 
infinite- volume Gibbs measure to be one whose conditional probabilities for finite sub- 
systems A, conditioned on the configuration outside A, are given by the Boltzmann- 
Gibbs formula based on the Hamiltonian Hf. This is the approach we shall take; the 
traditional approach can then be justified a posteriori (Propositions 2.22 and 2.23). 

Let us note that, in general, we must condition on the configuration in the entire 
exterior of A — that is, we must specify a complete "external condition". However, in 
the special case of a nearest-neighbor interaction (resp. an interaction of finite range 
i?), it suffices to specify the spins immediately adjacent to A (resp. the spins at a 
distance < R from A) — hence the term "boundary condition". We shall usually 
bow to tradition and call our external configurations "boundary conditions", but we 
emphasize that in the general case of an infinite-range interaction it is essential to 
specify the configuration in the entire exterior region. 

Let us also remind the reader of the physical role played by boundary conditions: 
in infinite volume the Gibbs measure (to be defined shortly) may not be unique, and 
the boundary conditions serve to select a particular Gibbs measure (i.e. a particular 
"phase"). All this will be described in greater detail in what follows. 

2.3.1 Specifications 



■} 



We begin by formalizing the idea of "conditioning on the exterior of a volume A 
irrespective of any particular formula for these conditional probabilities. The point 
is that for a given external configuration cuac, we wish to specify the (conditional) 
probability distribution of the spins inside the volume A: that is, we want to specify 
Vvoht_^^^{dLOf^). Such an object is called a probability kernel^^ . In general, a probability 
kernel tt from a space (0,JF) to another space (0',JF') is an object 7r(c<;,A) with two 
"slots": an "input" slot that accepts an input configuration cu G 0, and an "output" 
slot that accepts a set A ^ J-"' and returns its probability. More formally, a probability 
kernel from (0, JF) to (0', JF') is a map tt: X JF' — )• [0, 1] satisfying: 

(a) For each fixed cu G 0, 7r(c<;, • ) is a probability measure on (0', JF'). 



^®For a more extensive introduction to probability kernels and their properties, see [25, Section 56] 
or [272, Section III-2]. 
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(b) For each fixed A G JF', 7r( • , A) is a jF-measurable function on ft. 

We sliaii write sucli a probabiiity Icernei equivaientiy as 7r(c<;,A) = 7r(A|c<;) = Tr^^[A). 
Tlie first notation empliasizes tliat tt is a Icind of "transition probabiiity" (as in tlie 
tlieory of Markov processes); tlie second notation empliasizes that tt will later be in- 
terpreted as a conditional probability; and the third notation emphasizes that cu is a 
parameter ("boundary condition") indexing the probability measure on ft'. 

Thus, in our case we need to specify a probability kernel tta from (Oa=,^a=) to 
(Oa,^a)- For technical reasons, however, it is convenient to define tta instead as a 
probability kernel from the full space (0,JF) to itself: we then impose explicitly the 
condition that 7r(c<;, • ) depend on io only through its components cuac (i.e. it is J-'a^- 
measurable), and that it reproduce the "boundary condition" cuac when the question 
fed into its second slot concerns only spins outside A (i.e. when A G J^k^). We are thus 
led to the following definition^^: 

Definition 2.5 A specification^® is a family 11 = (7rA)Ae5 of probability kernels from 
(0,JF) to itself, satisfying the following conditions: 

(a) For each A ^ T , the function 7rA( • , A) is T k'^^ -measurable. 

(b) tta is J-'^c-proper, i.e. for each B G J^k^, TTt^{io,B) = xb{'^)- 

(c) If A C A', then tta'TTa = tta'."^^ 

Physically, the idea is that 7rA(cu, • ) is the equilibrium probability distribution for 
volume A subject to the boundary condition io outside A. Condition (a) states that 
this measure depends, in fact, only on the behavior of io outside A. Condition (b) 
states that for observations outside A, this measure equals the delta measure (5^^, i.e. 



"'^^See [299, Section 1] for a more leisurely discussion of these points. 

"'^^In some mathematical-physics literature (e.g. [123]) the term "local specification" is used. We 
emphasize that this adjective "local" is superfiuous; the concepts discussed here and in [123] are 
identical. In particular, the reader should not confuse this (redundant) use of the word "local" with 
our concept of "quasilocal specification" to be introduced in Section 2.3.3. 

^^The product of two probability kernels is a probability kernel: 

(7ri7r2)(w, A) = Tri{ui,dui')Tr2{ui',A) . 

For future reference we also define two ways of multiplying a measure by a probability kernel: 

(//7r)(A) = Li(duj)TT(uj,A) 

{jj. X tt){B) = fi(du))'iT(u),du)')xB('^ X '^') 

where A ^ !F and B ^ T x T' . Thus, // x tt is a probability measure on the product space (il x il', JF x 
T'), while jjlt: is its marginal on the second space {Q.' ,T'). 
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it reproduces the boundary condition. Condition (c) is a compatibility condition for 
pairs of volumes A C A': it states that if a volume A' is in equilibrium with its exterior, 
then all subsets of A' are in equilibrium with their exteriors. 

Definition 2.6 A probability measure fi on ft is said to be consistent with the spec- 
ification n = (7rA)Ae5 if iis conditional probabilities for finite subsystems are given by 
the {TTK)kes- that is, 

For each A G iS and A ^ T ^ £'^(xa|^a=) = t^ k^ ■ , ^) fi-a.e. (2.18) 

We denote by Q{Ii) the set of all measures consistent with 11. 

The following proposition gives two apparently weaker, but in fact equivalent, for- 
mulations of the condition (2.18): 

Proposition 2.7 Let 11 = (7rA)Ae5 be a specification, let fi be a probability measure on 
fl, and let A ^ S . Then the following are equivalent: 

(a) For each A^ T , £'^(xa|^a=) = t^k^ ■ , ^) ti-a.e. 

(b) There exists a measure z/a such that fi = v^t^k- 

(c) fi = fiTTA. 

Physically, (b) states that fi is the equilibrium probability distribution for volume A 
with some (possibly stochastic) boundary condition z/a, while (c) states that fi can 
itself play the role of z/a- 

Let us note that ^(H), the set of all measures consistent with H, is a convex set: if 
fii^ . . . ^ fin belong to ^(H), then so does any convex combination of them. The physical 
interpretation of such convex combinations, and of the extremal points of ^(H), will 
be discussed in Section 2.3.6. 

We also make the (trivial) remark that if the lattice C were finite^ then there would 
be a unique measure consistent with H, namely the measure 7r£(c<;, • ) which must be 
independent of io. This is one aspect of the fact that phase transitions cannot occur in 
finite systems. 

2.3.2 Gibbsian Specifications and Gibbs Measures 

An important example of a specification is the Gibbsian specification H* = (vrf )a£5 
corresponding to a given interaction $. More precisely, let $ be a convergent inter- 
action, so that we can define the Hamiltonians Hf with general external boundary 
conditions. Let /U° = Yl fi^ he a probability measure, called the a priori measure. We 

then define the conditional partition function 

Z^(cuAe) = f exp[-Ht{u)] n^/^°K)- (2.19) 

•^ xeA 
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[Note that the Hamiltonian Hf{uj) depends on both the spins cua inside A and on the 
"boundary conditions" loa^- After integrating out the spins cua, we obtain a function of 
cuac] Since Hf is everywhere finite, it follows that Zf{ujA'') > for all uj. If moreover 
-^a(^a=) < +00 for all A G iS and all cu G 0, we say that the interaction $ is /i°- 
admissible. Note in particular that if each Hf is bounded below — which certainly 
occurs if $ is absolutely summable, since this makes each Hf bounded — then $ 
is automatically //^-admissible. Also, if the single-spin space Oq is finite, then every 
convergent interaction is automatically //^-admissible [because the integral (2.19) is 
then a finite sum of finite terms]. 

Definition 2.8 Let /i° = Yl fJ-x be a probability measure, and let ^ be a convergent, 
1^^ -admissible interaction. Then the probability measure vrf(c<;, ■) on J^ defined by 

7r^(cu,A) = ZtiiOAc)-' IxaH eM-HlH] ^ J/.°K) (2.20) 

is called the Gibbs distribution in volume A with boundary condition cuac corresponding 
to the interaction $ and the a priori measure iJ^ . 

It is straightforward to verify that the family 11* = (vrf )a£5 is indeed a specification; 
it is called the Gibbsian specification for $ (and /i°). A measure consistent with 11* 
is called a Gibbs measure for $ (and /i°). By Proposition 2.7, a measure // is a Gibbs 
measure for 11* if and only if //vrf = ji for all A, i.e. 

JdiJ.{T)Zt{TAc)-' JxAiiOA X TAc) exp[-Ht{iOA X tac)] n dp'M = /i(A) (2.21) 

xEA 

for all A G JF and all A G iS. The equation (2.21) is called the Dobrushin-Lanford- 
Ruelle (DLR) equation. A slightly simpler equation is obtained by restricting A to 



J/iA . ^ 



I diJ.{T) Zf{TAc)-' exp[-Hf{iOA X tac)] /i°-a.e. (2.22) 



In general (2.22) is weaker than (2.21); the former is a necessary but not sufficient 
condition for // to be a Gibbs measure for 11*. However, in nearly all practical situations 
the two conditions are equivalent: see Remark 3 at the end of Section 2.3.3 below. 

At this point the reader may be wondering: Why have we bothered to introduce the 
very general (and abstract) concept of a specification, when virtually all of the concrete 
models studied in statistical mechanics correspond to Gibbsian specifications? We have 
two reasons: Firstly, non-Gibbsian specifications must be employed in some interesting 
statistical-mechanical problems, notably those involving hard-core exclusions (which 
we do not consider in this paper) or zero temperature (Appendix B.2.1). But perhaps 
more importantly, we want to be consistent with the underlying message of this work, 
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which is that not everything in the world is Gibbsian. Therefore, we must introduce 
concepts which are general enough so that the problems we wish to study will not have 
been excluded simply by definition. Having done so, we will then be able to investigate, 
without a priori preconceptions, which problems give rise to Gibbsian specifications 
and which ones do not. 

2.3.3 Quasilocality 

In all theoretical physics, a fundamental role is played by the concept of an "isolated 
system". A completely isolated system is of course an idealization, but one can in 
general render a system as close to isolated as desired by moving it a large distance 
away from all other objects. (Here we neglect cosmological effects, as well as couplings 
to fields that could carry off radiation.) This asymptotic isolation is possible, of course, 
because the interaction potentials decay to zero as the spatial separation tends to 
infinity. One can even argue that this decay of interactions is an essential precondition 
for the possibility of doing science: without it, the results of experiments on Earth 
would depend sensitively on conditions on Andromeda, and the repeatability that is 
fundamental to the scientific method would be absent. 

These remarks justify the introduction of a class of specifications that will play a 
central role in the remainder of this paper: 

Definition 2.9 A specification H = (7rA)Ae5 is said to be quasilocal if, for each A ^ S , 
f G Bqi[ft) implies tta/ G Bqi[ft). [Equivalently: f G Bioc{^) implies tta/ G Bqiijt).] 

Note that (7rA/)(cu) = f tt {^{u , du' ) f [u' ) is the mean value of / in the equilibrium 
probability distribution for volume A with boundary condition Uf^c Therefore, a speci- 
fication is quasilocal if the mean values of (quasi)local observables depend very weakly 
on the external spins far from A (e.g. outside a very large volume A') when the external 
spins in the intermediate region A' \ A are fixed, i.e. 



Jim sup |(7rA/)(cui)-(7rA/)(cu2)| = (2.23) 

(^i)a' = (^2)a' 



for all / G Bqiijt) [or i?;oc(f^)]-^° We emphasize that (2.23) constrains only the direct 
infiuence of the spins outside A' (since the spins in the "annulus" A' \ A are fixed). In 
particular, (2.23) is perfectly compatible with the occurrence of long-range order: it 
says merely that any infiuence on A from the spins outside A' has to be transmitted by 
the intermediate region. We emphasize also that this condition of "weak dependence" 
is formulated in the supremum norm, i.e. it is a "worst-case" condition. 



^"^If the state space ilo is finite, it suffices to check (2.23) for / G BiQ, J^a), because any / G Biod^) 
[say, / G 5(il, !F~) for some A Z) A] corresponds to finitely many different functions in 5(il, !F\) when 
one fixes the configuration i^T\ . • If the state space ilo is infinite, we do not know whether or not this 

weaker condition is equivalent to (2.23). 
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Examples. 1. If all the Hamiltonians Hf are local functions, then 11* is a quasilocal 
specification. This occurs, in particular, if $ is a finite-range (and //^-admissible) 
interaction. 

2. If all the Hamiltonians Hf are quasilocal functions, then 11* is a quasilocal spec- 
ification. This occurs, in particular, if $ is a uniformly convergent (and //^-admissible) 
interaction. 

3. Although we have not shown explicitly here how to treat models with constraints 
(e.g. hard-core exclusions), it is easy to see that local constraints do not disrupt quasilo- 
cality. 

Examples 1 and 3 cover all reasonable systems (of either bounded or unbounded 
spins) with finite-range interactions. Examples 2 and 3 cover all reasonable systems of 
bounded spins. Therefore, we argue that all systems of physical interest are quasilocal 
with the exception of models of unbounded spins with infinite-range interactions. These 
latter systems are, unfortunately, usually not quasilocal: 

4. Consider a model of real-valued spins {^i} — for example, a Gaussian or ip"^ 
model — with formal Hamiltonian 

H = -Y.J^m'^3 (2.24) 

where J has infinite range. Then the resulting specification is not quasilocal, because 
an external spin arbitrarily far away from the volume A can, by taking extremely large 
values, have large effects inside A. The trouble here is that quasilocality is defined 
in the supremum norm, which is too strong a condition for systems with unbounded 
Hamiltonians. (There is in fact a more serious difficulty in this example: for some 
external conditions the Hamiltonian i7* is divergent. Therefore, to treat these systems 
it is necessary to enlarge slightly the concept of specification in order to allow some 
external conditions to be "forbidden" [299, pp. 16-18 and 89] [245, 60], or else to play 
some minor trickery [157, pp. 261, 264-265 and 424-425].) 

We summarize the main conclusion from this discussion: 

Theorem 2.10 Let ^ he a uniformly convergent and iJ^ -admissible interaction. [In 
particular this happens if $ is absolutely summable, or if $ is finite-range and iJ^- 
admissible.J Then the specification H* is quasilocal. 

The Gibbsian specification arising from a model with finite (resp. bounded) Hamil- 
tonians has an additional characteristic property: 

Definition 2.11 A specification H = (7rA)Ae5 is said to be 

• nonnuU (with respect to /i°) if, for each A G iS and each A G J^a, 

//0(A) > =^ 7rA{Lo,A)>0 for allio en . (2.25) 
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• uniformly nonnuU (with respect to /i°) if, for each A ^ S , there exist constants 
< ctA ^ /3a < oo such that 

«a/(A) < TTAiio.A) < /3a /(A) (2.26) 

for all id ^ ft and all A G J-'a- 

Roughly speaking, "nonnuU" means that there are no hard-core exclusions, while "uni- 
formly nonnuU" means that moreover the finite- volume Hamiltonians are bounded (as 
a function of both the interior spins cua and the exterior spins loa^)- 

It turns out that the twin properties of being quasilocal and uniformly nonnuU 
exactly characterize the Gibbsian specifications for absolutely summable interactions: 

Theorem 2.12 (Gibbs representation) Let H be a specification, and let /i° be a 

product measure. Then the following are equivalent: 

(a) There exists an absolutely summable interaction $ such that 11 is the Gibbsian 
specification for $ and /i°. 

(b) n is quasilocal and is uniformly nonnuU with respect to iJ^ . 
Moreover, if the single-spin space Oq is finite, then these are also equivalent to 

(c) n is quasilocal and is nonnuU with respect to jjf^ . 

The proof that (a) ==;' (b) is easy; the nontrivial proof that (b) ==;' (a) is due to 
Kozlov [222]. The observation that (c) ==;' (b) for finite single-spin space was made 
by both Sullivan [336] and Kozlov [222]. 

Definition 2.13 A measure fi on ft is said to be quasilocal if there exists a quasilocal 
specification with which fi is consistent. (Sullivan [336] uses the term. "almo.st Marko- 
vian" in place of "quasilocal".) 

Theorem 2.12 implies that quasilocality is only slightly more general than Gibb- 
sianness for an absolutely summable interaction: roughly speaking, quasilocality allows 
for local constraints (e.g. hard-core exclusions) while Gibbsianness does not. 

Remarks. 1. For further discussion on the Gibbs representation theorem, in 
connection with translation invariance, see the Remark at the end of Section 2.4.9. 

2. Sullivan [336] and Gross [178, pp. 194-195] have introduced a slightly larger 
class of interactions than those considered here, based on the observation that the 
only energies which play a role in the definition of the specification 11 are the relative 
energies of pairs of configurations that differ at only finitely many .sites. Therefore, it 
is not necessary for the Hamiltonians 

Hl{u)= Y: ^4^) (2.27) 

Aes 
AnA^0 
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to be well-defined, but only the relative Hamiltonians 

HtiA^.uj') = Y: [^4^) - $4^')] (2.28) 

Aes 
AnA^0 

for configurations lo^uj' that agree outside A. It turns out [336, Proposition 3] that 
for interactions whose relative Hamiltonians are uniformly convergent (Sullivan calls 
these interactions "/^-convergent"), the corresponding specification is again quasilocal 
and nonnuU (at least for finite single-spin space). So this generalization does not 
provide examples of physically interesting non-quasilocal specifications. Indeed, we 
can combine this result with (c) =^ (a) of Theorem 2.12, and conclude that for any 
"relatively uniformly convergent" interaction $ (at least on a finite single-spin space) 
there is an absolutely summahle interaction $' such that 11* = 11* . Roughly speaking 
this means that $ and $' are "physically equivalent" (see Section 2.3.5). 

3. If n is a quasilocal specification, then the criterion for // to be consistent with 11 
can be weakened slightly: instead of requiring // = //tta [Proposition 2.7(c)], it suffices 
to have // = //tta on the a -field Tk [157, Remark 4.21]. Thus, if $ is a uniformly 
convergent (and //^-admissible) interaction, then the alternate DLR equation (2.22) is 
equivalent to the standard DLR equation (2.21). 

4. In rather great generality it can be proven [161, 300, 330] that every measure // 
is consistent with some specification. However, this specification will in general not be 
quasilocal. Indeed, in Section 4 we shall give numerous examples of measures that are 
not consistent with any quasilocal specification. 

2.3.4 Feller Property 

It is useful to single out a class of specifications in which the finite-volume Gibbs 
measure 7rA(cu, • ) depends in a "sufficiently continuous" way on the boundary condition 

Definition 2.14 A specification 11 = (7rA)A£5 is said to be Feller if, for each A ^ S, 
f G C(0) implies tta/ G C(0). 

Example. If the interaction $ is continuous and uniformly convergent (and /in- 
admissible), then the specification 11* is Feller. Thus, nearly all specifications of phys- 
ical interest are Feller. 

It is worth remarking that the definition of the Feller property formally resembles 
that of quasilocality: indeed. Definition 2.14 is identical to Definition 2.9, with Bqi[ft) 
replaced everywhere by C(0). In particular, if the single-spin space Oq is finite^ then 
Bqi[ft) = C(0), SO the concepts of "quasilocal specification" and "Feller specification" 
coincide. 

We can now state a very important uniqueness theorem: 
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Theorem 2.15 Let fi be a probability measure that gives nonzero measure to every 
open set U C 0.^^ Then there is at most one Feller specification with which fi is 
consistent. In particular, if the single-spin space Oq is finite, then there is at most one 
quasilocal specification with which fi is consistent. 

This theorem has an important consequence for the renormahzation group: it shows 
that the downward vertical arrow in (1.2) cannot be a multi-valued map, provided that 
we interpret H' as standing for a specification. 

Remark. Such uniqueness does not hold in general for non- Feller specifications. In- 
deed, if /ii, /i2, . . . is any finite or countably infinite set of probability measures that are 
distinguishable at infinity^^, there exists a specification (in general non-Feller and non- 
quasilocal) with which all these measures are consistent. ^"^ For example, let /ii,/i2, . . . 
be Gibbs measures of the two-dimensional Ising model at an arbitrary sequence of 
temperatures /3i,/32,... G [— oo,-|-oo]; then there exists a specification with which all 
these measures are consistent! (By Theorem 2.15, such a specification is of necessity 
non- Feller and non-quasilocal.) This remark shows that non-quasilocal specifications 
can be extremely pathological and "unphysical" ; it is an additional argument for the 
importance of quasilocality. 



2.3.5 Physical Equivalence in the DLR Sense 

The same physical situation can be described by many different interactions $. For 
example, the interactions 

r -hio, if A= {i} 

^A{io) = I - Jcu,M+i if A = {z, z + 1} (2.29) 

V otherwise 

and 

$^(cu) = j -f^. - -/^.M+i if A={i, 1 + 1} ^2.30) 



2 

otherwise 

both describe the one-dimensional Ising model with nearest-neighbor interaction J 
and magnetic field h; they are obviously "physically equivalent". The reason they 
are "physically equivalent" is that they define the same .specification — and it is the 
specification that determines the physics. 

Refiecting a little bit on this and similar examples, one comes to the following 
definition [157, Section 2.4]: 



^^This means, roughly speaking, that every configuration in il is "possible", i.e. there are no "hard- 
core exclusions" . 

^^This means that there exist disjoint sets FijF'j,... G .?^oo = -^A= such that fik(Fk) = 1 for 

A6<S 

each k. 

'^^Proof: Form the measure fi = ^j. c^fik, where ci,C2, . . . > is any sequence with sum 1. By 
[161, 300, 330] there exists a specification H with which fi is consistent. But then fij. = c^ Xf^I^ is 
also consistent with H [299, Lemma 2.4]. 
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Definition 2.16 Let $ and $' be convergent interactions. We say that $ and $' are 
physically equivalent in the DLR sense if, for all A G iS, the function H^^ — H^^ is 
J^f^c -measurable (i.e. depends only on the .spins outside AJ. 

One can then prove the following theorem: 

Tlieorem 2.17 Let $ and $' be convergent ji^ -admissible interactions. Consider the 
following .statements: 

(a) $ and $' are physically equivalent in the DLR sense. 

(b) n* = n* , i.e. the specifications for $ and $' coincide. 

Then (a) =^ (b). Moreover, if IJ^{U) > for every open set U C O,^"* and the 
interactions $ and $' are continuous, then (b) =^ (a). 

Corollary 2.18 (Griffiths— Ruelle) Let $ and $' be uniformly convergent, continu- 
ous, fi'^-adm.issible interactions; and as.sume that IJ^{U) > for every open set U C ft. 
If there exists a mea.sure fi that is Gibbsian for both $ and $', then $ and $' are 
physically equivalent in the DLR sense, and H* = H* [hence $ and $' have exactly 
the same Gibbs mea.sure.s] . 

There are several ways to deal with the ambiguity caused by physical equivalence. 
One way is to select a single "preferred" representative from each class of physically 
equivalent interactions: in the Ising model this is exemplified by the possibility of 
using "spin" interactions $^ = — J^cr^ or "lattice-gas" interactions $^ = —JaP^ = 
— Ja i^^') [206, 351]; and more generally it is exemplified by the concepts of "a- 
normalized" interactions and "gas" interactions [157, Sections 2.3 and 2.4]. However, 
for interactions which are not finite-range, this approach can give rise to convergence 
problems [351]. 

The other approach is to accept the ambiguity as inevitable, and to work with 
equivalence classes of interactions modulo physical equivalence. We shall take this 
latter approach. The key result here is Corollary 2.18, due originally (albeit in a 
very slightly weaker form) to Griffiths and Ruelle [174]. This result has an important 
consequence for the renormalization group: it shows that the downward vertical arrow 
in (1.2) cannot be a multi-valued map, provided that we interpret H' as standing for 
an equivalence class of interactions modulo physical equivalence. 

To avoid trivialities, we as.sume henceforth that IJ^{U) > for every open set U C ft. 



^■*This means, roughly speaking, that every configuration in il is "possible". If it were not so, then 
the true configuration space would be a proper closed subset F = flj;?/; ^uPPA'Oa; ^ ^- ^^ could 
then make the condition hold simply by redefining the configuration space to be F rather than il. So 
the condition means simply that the configuration space does not contain any "useless points" . Some 
such condition is needed for (b) ==?- (a) to hold, because the interaction $ is completely arbitrary at 
the "useless points" w G il \ _F. 
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2.3.6 Structure of the Space Q{Il) 

Physical systems exhibit in general one or more possible "macrostates"^^, depending 
on the values of some control parameters. For instance, water can be in a liquid, 
solid or gaseous "macrostate" depending on temperature and pressure; and there are 
points on the temperature-pressure phase diagram where two or even all three of these 
"macrostates" are possible. 

The physical relevance of the theory developed in the preceding subsections relies 
on the assumption that for each physical system there exists a specification 11 from 
which all the statistical-mechanical information about the system can be obtained: 
that is, such that the space Q{Ii) of measures consistent with 11 describes all the 
"macrostates" of the physical system that are possible for the given choice of control 
parameters. Therefore, we must be able to transcribe all the expected properties of the 
set of these "macrostates" in terms of properties of the space ^(H). We briefiy discuss 
here this transcription. In consistency with our main message that not everything in 
the world is Gibbsian, everything in this subsection holds for general specifications^ 
which need not be Gibbsian. (This generality will also be useful when discussing 
statistical mechanics at zero temperature: see Appendix B.2.1.) However, for the sake 
of brevity and familiarity, we will sometimes refer to the measures consistent with 11 
as the "Gibbs measures" for 11 — which is a slight abuse of language when 11 is not 
Gibbsian. 

There are two important properties that characterize the macroscopic systems ob- 
served in nature. Firstly, these systems involve a huge number of degrees of freedom, 
so large that only a statistical description is possible. However, these statistical aspects 
do not manifest themselves at a macroscopic level: that is, macroscopic observables do 
not fluctuate; the system behaves deterministically with respect to them. The second 
property refers to the microscopic observables: they do fiuctuate, but their fiuctua- 
tions are only local, not affecting large regions. Fquivalently, local observations made 
far away one from, the other are almost independent . 

To translate these properties into precise mathematical statements, we need first to 
specify what a macroscopic observable is. As is usual with long-used concepts, there is 
more than one possible meaning. Some people consider a macroscopic observable to be 
any translation-invariant measurable function. At this point, however, we would like 
to remain at a general level, leaving the aspects related to translation-invariance until 
the next section. So we adopt an alternative definition, which corresponds to what 
could be called "global" observables, namely observables that do not depend on what 
happens to finitely many spins. Recall that if A is a finite subset of the lattice, then 
J^f^c is the cr-field consisting of all events that are measurable by observations made 



^^These "macrostates" are also referred to as "phases" in the chemical and physical literature. 
Here, following an established mathematical-physics nomenclature, we reserve the word "phase" for 
the notion of "pure phase" , to be defined in Section 2.4.9 below. For this informal discussion we prefer 
to use the word "macrostate" , but keeping the quotation marks to emphasize the informality of the 
concept. We do not want to get entangled with the many different senses adopted in the literature 
for the word "state" . 
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solely outside A; that is, they are the events that do not depend on the behavior of the 
spins inside A. Now consider the events that belong to J-'^c for every finite subset A: 
these events constitute a cr-field 

^oo = n -^A^ ' (2-31) 

Aes 

which consists of all those events whose definition is not affected by changes on any 
finite number of spins. This field is usually called in mathematics the tail field, and 
could be thought as the field of global events. The functions measurable with respect to 
this field are called ohservahles at infinity and can be interpreted as global ohservahles . 

Examples of global observables. f. All "macroscopic averages", for instance 
observables of the form 

{lim |A„|~"^ X] fi'^x) if the limit exists 
.eAn (2.32) 

(or whatever) otherwise 

where ( A„) is a suitable increasing sequence of finite subsets of C which together exhaust 
C (we will discuss this further in Section 2.4.f ), and /: Oq — ^ R is a measurable function. 
A macroscopic average as in (2.32) is obviously unaffected by altering finitely many 
spins, so / is indeed an observable at infinity. 
2. In an Ising model, consider 



9{^) 



' 1 if there exists an infinite connected cluster 

of + spins (2.33) 

. otherwise 



The existence of an infinite cluster is obviously unaffected by altering finitely many 
spins, so g is indeed an observable at infinity. (This observable is of particular impor- 
tance in percolation theory.) 

3. In an Ising model, consider the difference in magnetization between the even 
and odd sublattices: 

I lim |A„|~"^ J2 ( — l)'"^'^!; if the limit exists 

Mstagg = I ""^^ -^An (2.34) 

I otherwise 

where (A„) is as before. This also is obviously an observable at infinity. 

Thus, the usual macroscopic measurements performed on real systems correspond 
to global observables, but the converse is not true: as Example 3 illustrates, our con- 
cept of "global observables" includes some quantities that are experimentally not very 
accessible. For example, in the antiferromagnetic Ising model, the sign of the staggered 
magnetization is an observable at infinity, which detects which of the two sublattices 
is positively magnetized and which is negatively magnetized. But it is very unlikely 
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that an experimenter could succeed in reliably labelling the two sublattices, much less 
in measuring separately their magnetizations. 

After the previous discussion, we can now state more precisely which properties a 
measure fi representing a "macrostate" of a physical system must have: (i) It must 
be deterministic on global events, that is fi{A) can only take the values or 1 for 
an event A G J-'oo] and (ii) its expectation for spatially distant events must, in some 
sense, asymptotically factorize (= short-range correlations = (some type of) cluster 
property). It turns out that these two properties are equivalent: 

Proposition 2.19 Let fi G Af_|_i(0). Then the following properties are equivalent: 

(a) fi has trivial tail field, that is, if A ^ Too then /u(A) equals either or 1. 

(h) fi has short-range correlations, that is, for each A (^ T we have 

lim sup |/i(An5) -/i(A)/i(5)| = 0. (2.35) 

AT/: BdTiyc 

Aes 

Property (a) states, roughly speaking, that all the observables at infinity (= global 
observables) take a constant value from the point of view of the measure //. For 
instance, the fact that all the sets of the form {io G 0: f{io) G B} have measure either 
or 1 means that there is a precise value /^ such that f = f^ with //-probability 
1. Property (b) is a strong "cluster property": it states that distant regions of the 
lattice are asymptotically independent (even if one of the regions involves infinitely 
many spins), uniformly in the observable measured in the second region. 

Now fix a specification II, and let us consider the structure of the set ^(H). We 
know that ^(11) is a convex set, so it is natural to ask what are its extreme points. ^^ 
The answer is: 

Proposition 2.20 Let fi G ^(H). Then the following properties are equivalent: 

(a) fi is an extreme point ofQ(Y[). 

(h) fi has trivial tail field. 

(c) fi has short-range correlations. 

The upshot of the preceding discussion is that the "macrostates" of a physical 
system described by a certain specification correspond to the extremal Gihhs mea.sures 
for this specification. What is the interpretation of the non-extremal measures of ^(11)? 
For "nice" convex sets, every point in the set can be represented as the barycenter of 
a probability measure concentrated on the extreme points (this is a kind of "integral" 



^®We recall that the extreme points of a convex set are those that cannot be written as a non-trivial 
convex combination of other points in the set. 
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convex combination). It turns out that Q{Il) is nice in this sense. ^^ Thus, every non- 
extremal measure in Q{Ii) is an (integral) convex combination of extremal ones. In fact, 
a deep result ([299, Theorem 2.2], [157, Theorem 7.26]) states that this decomposition is 
unique^ that is, that ^(11) is a simplex. These results mean, in experimental terms, that 
a non-extremal Gibbs measure corresponds simply to the preparation of a randomly 
chosen extremal Gibbs measure. The probabilities for this choice are given by the 
"coefficients" of the convex combination. This extra randomness can be interpreted as 
representing ignorance on the part of the experimenter about the system's "macrostate" 
(i.e. over and above his/her unavoidable ignorance about its microstate). From this 
point of view, the physical system itself can always be considered to be in a well- 
defined "macrostate" described by an extremal Gibbs measure. Thus, the extremal 
Gibbs measures are the "pure" physical objects. 

As a consequence of the preceding discussion, we conclude that the cardinality of 
the set of extremal measures of ^(11) represents the number of physical "macrostates" 
available to the system. A change in this number as the control parameters are varied 
corresponds to a phase transition (more precisely, to one of the notions of phase transi- 
tion, see Section 2.6.5); and the variation of this number as a function of these control 
parameters (temperature, magnetic field, chemical potential, etc.) can be recorded in 
the form of a phase diagram.. Therefore, the study of the set of extremal measures 
of ^(n) is a central problem in statistical mechanics. As a first step, it is essential 
to determine conditions under which the set ^(11) is nonempty , i.e. under which there 
exists at least one infinite- volume Gibbs measure. Contrary to what one might initially 
think, this is a non-trivial problem, since there exist physically quite reasonable models 
for which there are no infinite- volume Gibbs measures. The typical examples are the 
short-range massless Gaussian models (harmonic crystals) in dimension d < 2^ and the 
solid-on-solid or the discrete Gaussian models in d = 1. The essential point here is 
that the existence of Gibbs measures in these models is equivalent to the breaking of 
a non-compact symmetry of the single-spin space (the shift of all the spin values by a 
constant); and, as is well known, it is impossible to break discrete symmetries in d = 1 
or continuous symmetries in d < 2. We refer to [157, Chapter 9] for precise state- 
ments, references and further examples. In any case, the following theorem suffices for 
virtually all applications to models of bounded spins: 

Proposition 2.21 Let ft be a compact metric space, and let 11 = (7rA)Ae5 be a Feller 
specification. Then Q{Tl) is nonempty. 

This result is, in fact, an immediate consequence of Proposition 2.22 below: take 
any sequence whatsoever of boundary conditions {vn)] by compactness, the sequence 
{^riT^kn) must have at least one limit point //, and Proposition 2.22 then guarantees 
that ji G ^(n). 



^^For systems of bounded spins this can be proven by appealing to the Choquet theorem [293, 206]. 
For general systems it can be proven through direct probabilistic arguments [299, pp. 24-32] [157, 
Section 7.3 and the associated notes] [107]. 

39 



If there are several "macrostates" available to the system, and an experimenter 
wants to select a particular one with absolute certainty, how must he/she proceed? 
There are basically two ways: One approach is to add to the Hamiltonian some addi- 
tional fields, such that an infinitesimal value of these fields — more precisely, a limit 
process consisting in turning them on and then slowly off in some appropriate sequence 
— selects one or the other of the "macrostates". For example, in an Ising model at 
low temperature, one may add to the Hamiltonian a magnetic field h; the limits h I 
and /i I then select the extremal Gibbs measures /i_|_ and //_ of the zero-field Ising 
model. An alternative approach is to immerse the (finite) sample in a configuration 
typical of the intended "macrostate" (selection via boundary conditions). For exam- 
ple, in the Ising case we could use boundary conditions in which the spins outside the 
volume A are fixed to be all -|- or all — ; taking the limit A | £ with these boundary 
conditions again selects /i_|_ or //_, respectively. In relation with this second point of 
view we present two propositions, the first of which justifies a posteriori the traditional 
approach to infinite-volume lattice systems based on infinite-volume limits: 

Proposition 2.22 Let 11 = (7rA)Ae5 be a Feller specification. Let (A„)„>i be an in- 
creasing sequence of finite volumes whose union is C, and let {i'n)n>i be an arbitrary 
sequence of probability measures on ft (i.e. arbitrary deterministic or random boundary 
conditions). Let iJ. be any limit point (in the weak topology) of the sequence {i'n'^A„)n>i- 
Then fi is consistent with 11. In particular, Q{Ii) is a closed subset o/Af_|_i(0). 

Proposition 2.23 Let ft be a compact metric .space, let 11 = (7rA)Ae5 be a Feller 
.specification, and let fi be an extreme point ofQ(Y[). Then, for fi-a.e. u, 

lim S^-Kf^ = fi (2.36) 

A t £ 

A es 
in the weak topology. 

Proposition 2.22 states that any weak limit of finite-volume Gibbs measures, with 
arbitrary deterministic or random boundary conditions, is an infinite-volume Gibbs 
measure. This is the link between the DLR approach and the traditional approach 
via limits of correlations. Proposition 2.23 is a very strong converse statement, for 
the special case of extremal Gibbs measures: it states that if one takes any "typical" 
configuration from the measure // and uses it as a boundary condition, in the infinite- 
volume limit one recovers //. This is the mathematical transcription of the process 
of selecting a "macrostate" by preparing the sample with an appropriate boundary 
condition. In fact, there is a revealing generalization of this, that states that if // is 
any Gibbs measure, then if one takes a "typical" configuration from the measure // 
and uses it as a boundary condition, in the infinite-volume limit one recovers one of 
the extremal Gibbs measures in the decomposition of // [156]. This theorem can be 
interpreted as saying that the result of a measurement on a large (strictly speaking 
infinite) system will always yield a value characteristic of one of the extremal Gibbs 
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measures: for example, a measurement of the magnetization in a low-temperature Ising 
model at zero magnetic field will always yield ibAfo, not an intermediate value. 

Finally, the consistency between the physical picture and the mathematical formal- 
ism requires some discussion of the issue of distinguishability of "macrostates". Phys- 
ically, two "macrostates" should be considered different only if there is some macro- 
scopic measurement that can tell the difference. In terms of the formalism discussed so 
far, this corresponds to the requirement that global observables be able to distinguish 
among the different extremal measures for a given specification. The following theo- 
rem shows that even more is true: the global observables uniquely characterize each 
measure — extremal or not — consistent with a given specification. 

Theorem 2.24 Let 11 be an specification. Then: 

(a) The extremal measures of Qijl) are mutually singular when restricted to the tail 
field. That is, if fi and v are distinct extremal mea.sures of Qijl), there exists a 
set A G J^oo such that /u(A) = I and v{A) = 0. 

(b) Each mea.sure fi G ^(H) is uniquely determined [among the mea.sures of Qijl)] by 
the events in the tail field. That is, if fi and v are mea.sures in Q{Tl) .such that 
/i(A) = v{A) for each A G J^oo, then fi = p. 

For the proof, see [157, Theorem 7.7]. 

2.3.7 Conditioning on an Arbitrary Subset of Spins 

The DLR equations tell us how to condition on the spins in the complement of a finite 
set. However, in Section 4 we shall need to condition on sets of spins which are not 
complements of finite sets. Therefore, we need the following technical construction, 
which can be skipped on a first reading. 

Let n = (7rA)Ae5 be a specification. Let A be a subset of C [not necessarily co- 
finite!). Let cu G be a configuration (but only its components cua will play any role). 
We then define the system, restricted to the volume £ \ A, with configuration space 
(Oo)'^^^: the specification for volume jC\ A with external spins set to uj^ is the family 
n"" = (Oag^.acAA defined by 



TT 



:{u',A) = TrA{uAXu',A) (2.37) 



where uj' G (flo)^^^ and A G J^c\/i- Clearly the functions 7r^(-,A) are J^(c\/i)\k- 
measurable. It is easy to see that the family 11'^ defines a specification on the system 
with lattice £ \ A. 

Let now // be a measure consistent with 11. Let fi.^ be a regular conditional proba- 
bility for /i given JF^. (Such regular conditional probabilities always exist if (0,JF) is, 
for example, a standard Borel space. This includes all examples of physical interest.) 
We then have the following intuitively obvious result: 

Proposition 2.25 For fi-a.e. u, the mea.sure fi^\J-'c\A is consistent with 11^. 
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2.4 Translation Invariance 
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Until now the lattice C has been simply a countably infinite set of sites, devoid of 
any geometric structure. In most applications, however, £ is a regular J-dimensional 
lattice; this additional structure allows us to define the notion of translation invariance 
for measures, interactions, specifications and so forth. For simplicity we shall take C to 
be the simple (hyper)cubic lattice Z . This is no real loss of generality, because other 
regular lattices can be mapped to Z by an appropriate labelling of sites. ^^ 

2.4.1 Van Hove Convergence 

An important role in the statistical mechanics of translation-invariant systems is played 
by sequences of volumes (A„) which grow in such a way that the surface-to- volume ratio 
tends to zero. We therefore make the following definitions: 

Definition 2.26 Let r > 0, and let A C I"^ . We then define 

• the inner r-boundary d~ A = {x G A: dist(a;,A^) < r} 

• the outer r-boundary d^A = {x G A^: dist(a;,A) < r} 

• the r-boundary drA = d~ A U d^A 

We can then state the desired condition in a number of equivalent ways: 

Proposition 2.27 Let (A„)„>i be a sequence of nonempty finite subsets of Z . Then 
the following are equivalent: 

(a) lim |arA„|/|A„| = 0. 

(b) lim |a+A-^l/|A.| = 0. 

(c) For each r > 0. hm |9rA„|/|A„| = 0. 

(d) For each a G T^ , lim |A„ \ (A„ + a)|/|A„| = 0. 

(e) For each a G 1'^ , lim |(A„ + a)\ A„|/|A„| = 0. 

(f) For each finite subset A C T^ , lim |A„A(A„ + A)|/|A„| = 0. 



ri^oo 



^^References for this section are Georgii [157, Chapter 14], Israel [206, Chapter IV], Preston [299, 
Chapter 4] and Ruelle [313, Chapter 3]. 

^^What is really relevant here is not that C equals Z , but merely that the additive group Z acts on 
£,: that is, there should exist bijections ta'- £ ^ £ (a G Z ) such that tati = ta-\-i and to = identity. 
The formulae below can easily be generalized to this case, by replacing each occurrence of a; — a by 

ta{x). 
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Moreover, all of these conditions imply that: 

(a) lim |A„| = oo. 

([3) There exist vectors a^ ^ ~L such that the translates A„ — a„ fill out Z in the 
following sense: for each finite subset A G Z , there exists no{A) < oo such that 
A C An — ttn for all n > no (A). 

Definition 2.28 A sequence (A„)„>i of nonempty finite subsets of Z is said to con- 
verge to infinity in the sense of van Hove (denoted A„ y ooj if it satisfies any one 
(hence all) of the equivalent conditions of Proposition 2.27. 

Definition 2.29 Let F be a function from S (the nonempty finite subsets of Z ) to 
some metric .space W, and let w be some element of W . We write iim -F(A) = vo in 

A/'oo 

case iim F(A„) = vo for every sequence (A„) that tends to infinity in the sense of van 
Hove. 

2.4.2 Translation-Invariant Measures 

Witli tliese preparations, we now focus attention specifically on translation invariance 
in lattice spin systems. With £ = Z , the translation group Z acts on the infinite- 

volume configuration space = (Oq) by 



{Taio)^^ = ujx-a for all x G >C (2.38) 

where a G Z . (The minus sign here means that T^OJ is the configuration uj translated 
forward by a.) This action on the configuration space induces in turn an action on 
functions 

(TJ){uj) = f{TaUj) for all UJ eft (2.39) 

and on measures 

{Tali){A) = ii{T-^[A]) forallAG.F. (2.40) 

A function / G B{fl) is said to be translation-invariant if Taf = f for all a G Z . A 
measure // is said to be translation-invariant if Ta/i = // for all a G Z . We denote by 
Minvift) and Af_|_i^m^,(0) the spaces of translation-invariant measures. All this is just a 
precise mathematical statement of the obvious notions that everybody has in mind. 

At this point we can repeat the considerations done in Section 2.3.6, this time re- 
garding the relationship between physical "macrostates" and elements of Af_|_i^m^,(0). 
If we take the point of view that the "macroscopic" observables are the translation- 
invariant bounded measurable functions, then the requirements for a measure fi repre- 
senting a physical "macrostate" are: (i) Translation-invariant functions in B[ft) must 
not have fiuctuations with respect to //, i.e. they must be constant with //-probability 
one; and (ii) the probability of far-away events must factorize in some sense. Once more, 
the extremal measures are the objects with the right properties. Indeed, Af_|_i^m^,(0) is 
a convex set, and its extreme points are characterized by the following theorem: 
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Proposition 2.30 Let fi G Af_|_i^m^,(0). Then the following properties are equivalent: 
(a) fi is an extreme point of M^i^inv{^)- 
(h) Every translation-invariant function f G B[ft) is fi-a.e. constant. 

(c) limn^oc n-"^ J2aeCn Kf Tag) = Kf)K9) for all f,g e 5(0) [or Bgi{n) or 
Bioc{^) or C{ft)], where Cn is a cube of side n. 

(d) liniA/oo lAI-^EaeA/wl/raS') = KDKg) for all f,g G 5(0) [or Bgi{n) or 
Bun) or C(0)7. 

We notice that the "cluster property" embodied by properties (c) and (d) is much 
weaker than the one presented in Section 2.3.6 [part (b) of Proposition 2.19]: (c) 
and (d) state that distant regions of the lattice are asymptotically independent, but 
only in an averaged sense. A measure fi G Af_|_i^m^,(0) having the properties listed in 
Proposition 2.30 is said to be ergodic. 

Therefore, by considerations analogous to those of Section 2.3.6, if we consider the 
translation-invariant functions to be the only "macroscopic" observables, then the er- 
godic measures are associated to physical "macrostates" and their convex combinations 
to "mixtures" representing ignorance on the part of the experimenter. Note that, as 
in the first part of Section 2.3.6 (through Proposition 2.19), we have not made any 
reference to interactions, specifications or Gibbsianness; the present comments have 
general validity. 

We have now introduced two distinct classes of observables that could plausibly 
be called "macroscopic": the global observables (Section 2.3.6) and the translation- 
invariant observables (present section). Which class truly corresponds to the "experi- 
mentally accessible" observables? This question does not have a canonical answer: it 
all depends on the system and the experiments. It is known [157, Proposition 14.9] that 
for a translation-invariant measure //, every translation-invariant function is measur- 
able at infinity, modulo a set of //-measure zero. The converse is not true. By limiting 
ourselves to translation-invariant observables, we eliminate some not-very-accessible 
global observables, like the staggered magnetization mentioned in Section 2.3.6. 

Analogous questions could be posed in relation to whether the extremal measures of 
^(n) or the extremal measures of Af_|_i^m^,(0) should represent physical "macrostates". 
We shall comment briefiy on this point once we define the notion of translation-invariant 
specifications (Section 2.4.9). For now, let us comment that the ergodic measures have 
the additional appeal of being precisely those for which "space averages equal ensemble 
averages" : 

Proposition 2.31 (Ergodic theorem) Let iJ. be an ergodic translation-invariant prob- 
ability measure on 0, and let f G L^{ji). Then: 

(a) lim |A|~^ Xl Taf = Jf dfi in L^[fi) norm.. 
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(b) lim n '^ J2 Taf = { f da pointwise n-a.e. 

Part (a) is called the L^ (or mean) ergodic theorem; it is easily generalized to L^ for all 
p < oo. Part (b), which is much deeper, is called the BirkhofF (or individual) ergodic 
theorem. 

The simplex Af_|_i^m^,(0) of translation-invariant measures has the property that its 
extremal elements — namely, the ergodic measures — are dense in the whole set, in 
the bounded quasilocal or weak quasilocal topology. In other words, any translation- 
invariant measure fi can be approximated arbitrarily closely, with regard to any finite 
set of (quasi)local observables, by ergodic measures. Physically this means that through 
observations in any finite volume, no matter how large, one cannot learn the long-range 
correlation properties of the measure // (ergodicity or the lack thereof). The proof of 
this fact is really quite simple: Pave Z by cubes of side n; let //„ be equal to ji on 
each cube, but independent between cubes (i.e. cut the correlations between distinct 
cubes); and finally, let /!„ be //„ averaged over the n'^ possible translates (so as to 
make it translation-invariant). Then it is easy to see that /!„ is ergodic, and that 
lim^^oo /Wra = /W in the bounded quasilocal topology. We have just sketched the proof 
of: 

Proposition 2.32 The ergodic measures are a dense subset of Mj^i^inyijt) , in the 
bounded quasilocal topology [and hence also in the weak quasilocal topology]. 

The density of the ergodic measures is thus an intrinsic and natural feature of 
infinite-volume physics. Geometrically, however, a simplex with dense extreme points 
(a so-called Poulsen simplex) is highly unintuitive. Indeed, our usual intuition, derived 
from finite-dimensional geometry, is that the extreme points should form a closed subset 
(as e.g. the vertices of a triangle, of a tetrahedron, etc.). The unusual behavior of 
M+i^invi^) is possible only in infinite dimensions. It will be at the origin of many of 
the "pathologies" to be discussed in Section 2.6.7. 

Remark. It is an amazing mathematical fact that a (compact metrizable) sim- 
plex with dense extreme points is essentially unique: all Poulsen simplices are affinely 
homeomorphic to each other [252, 284]. 

If we think of the ergodic measures as representing all the "macrostates" available 
to the system, it is natural to inquire whether the translation-invariant observables 
distinguish between different such measures, as is desirable on physical grounds (see 
the analogous discussion at the end of Section 2.3.6). The answer is yes: 



Theorem 2.33 (a) The extremal measures of M^i^inv{^) (i-e. the ergodic measures) 
are mutually singular when restricted to the a-field Tinv of translation-invariant 
events. That is, if fi and v are distinct ergodic mea.sures, there exists a set 
A G J^inv such that /u(A) = 1 and v{A) = 0. 
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(b) Each measure fi G Af_|_i^m^,(0) is uniquely determined [among the measures of 
M+i^inv{^)J by the translation-invariant events. That is, fi and v are measures 
in Mj^i^inyijt) such that /u(A) = v{A) for each A G J^inv, then fi = p. 

For the proof, see [157, Theorem 14.5]. In fact, this theorem is also true with the 
invariant field J-'inv replaced everywhere by the tail field J-'oo] this follows from [157, 
Proposition 14.9]. 



2.4.3 Dividing Out Translation Invariance 

Translation invariance brings along some natural notions of "equivalence". For in- 
stance, different observables cannot always be distinguished when looked at in a trans- 
lation-invariant measure. (Example: ctq versus crn.) In this section we discuss the 
central object generating all these notions of "equivalence", namely the set of functions 
that have zero average with respect to all translation-invariant measures. 

From now on until the end of Section 2, we shall generally assume that the single- 
spin space Oo is a compact metric space, i.e. we restrict attention to models of bounded 
spins. The configuration space is then also compact. This restriction is made 
primarily to simplify the exposition; in Appendix A we partially remove this restriction. 

The functions of interest here are characterized by the following proposition: 

Proposition 2.34 Let Oq be a compact metric .space, and let f G C(0). Then the 
following properties are equivalent: 

(a) f has zero mean with respect to every tran.slation-invariant probability mea.sure, 
i.e. Jf dfi = for all fi G Af_|_i^m^,(0). 

(b) f has zero mean with respect to every translation-invariant finite signed mea.sure, 
i.e. Jf dfi = for all fi G Minv{^)- 

(c) f lies in the closed linear .span of the family of functions {g — Tag: g G C(0), a G 



(d) lim n~ 



fe) lim lAI 

* ^ A/oo 



E TJ 

aeC'n 



aeA 



0. 



0. 



We denote by X the class of functions having the properties specified in the foregoing 
proposition; it is a closed linear subspace of C(0), and is exactly the annihilator of 
Minv{^)- The space X will play a very important role in the theory of translation- 
invariant equilibrium measures, and in particular in the discussion of "physical equiv- 
alence". We define the quotient (semi)norms: 

lloo = i(sup/-inf/) (2.41) 



C{n)/const 



c(n)/i 



C(n)/(I+const) 



ltJf-4o 

inf||/-5||c 

g£l-\-const 



(2.42) 
(2.43) 
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The quotient (semi)norms in C{ft)/X and C (ft) / [X -\- const) are given by simple explicit 
formulae: 

Proposition 2.35 Let f G C(0). Then: 
(a) lim |A|~"^ Y, Taf exists and equals ||/||c(fi)/J- 



A/'oo 



aeA 



(h) lim |A| ^ E Taf exists and equals \\f\\c(n)l(i+const)- 



A/'oo 



aeA 



C(n)/const 

2.4.4 Spaces of Translation-Invariant Interactions 

With C = 1. , it also makes sense to discuss translation-invariance of interactions: 
Definition 2.36 An interaction $ = ($a) is said to he translation-invariant if 

^A+x = T^'^A for all AeS, x e I^ . (2.44) 

For example, the Ising interaction (2.12) is translation-invariant iff J^y = J{x — y) 
and hx = h = constant. 

We now introduce some important Banach spaces of interactions: 

Definition 2.37 For each a > 0, we denote by B" the space of translation-invariant 
continuous interactions with norm 

||$||b. = El^l""'ll^x||oo < ^. (2.45) 

More generally, for any translation-invariant function h: S ^ [l,oo), we let Bh be the 
space of translation-invariant continuous interactions with norm 

Mb, = Et^II^^IU < ^- (2-46) 

The most important of these spaces are S° ("Israel's big Banach space") and B^ ("Is- 
rael's small Banach space"). Indeed, S° is naturally related to C(0) [see Proposition 
2.40 below], and so will be the natural space on which to develop the theory of equilib- 
rium measures (Section 2.6); while B^ is the space of translation-invariant absolutely 
summable continuous interactions (see Definition 2.3), and so is a natural space for the 
theory of Gibbs measures. Note that our assumption /i > 1 implies that II^Hb^ ^ II^IIb" 
and hence Bh C S°; so S° is the largest space of interactions that we shall consider. 

Let us also introduce the space Bfinite consisting of all finite-range translation- 
invariant continuous interactions. Bfinite is a dense linear subspace of each of the Banach 
spaces Bh- It will sometimes be convenient to carry out proofs first for some class of 
"nice" interactions — e.g. finite-range ones — and then extend to more general inter- 
actions by a density argument. 
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Remark. The hypothesis of continuity of the interaction plays a role in some but 
not all of the theorems below (the mathematically inclined reader is invited to figure 
out which ones). To avoid complicating the notation, we have included continuity as 
part of the definition of the spaces B"^ Bh and Bjinite- 

We emphasize that all the spaces B" permit two-body (or more generally n-body) 
interactions of arbitrarily long range ^ provided only that they are absolutely summable. 
Indeed, for a pure n-body interaction $, the norms || • \\isa are all equivalent: we have 
||$||b« = ^"II^IIb"- The difference between the spaces B" is that lower values of a 
permit interactions which contain heavier contributions from large n, i.e. which are 
"more strongly many-body". If we want to force $ to be "short-range", we must take 
h[X) to grow to -|-oo as the diameter of X (and not just its cardinality) tends to 
infinity [199, 294]: 

Definition 2.38 We write h^lif, for each K < oo, there exists R = R{K) < oo 
such that h[X) > K whenever diam(X) > R. [Equivalently, for each K < oo, there 
are only finitely many X (modulo translation) such that h[X) < K .] In this case we 
say that Bh is a space o/ short-range interactions. 

The following proposition will be useful in Sections 3.3 and 5.1.2: 

Proposition 2.39 Fix a translation-invariant weight function h: S ^ [I,oo), and fix 
M <oo. Then: 

(a) The hall {$: II^Hb^ < M] is a closed subset of B° . 

(h) If h ^ 1 and the single-spin space Oq is finite, then the ball {$: II^Hb^ ^ M} is 
a compact subset of B^ . 

Remark. Since we are here using the sup norm ||$a||oo to measure the strength of 
an interaction, all of the above spaces consist solely of bounded interactions. This is fine 
for systems of bounded spins, but these spaces are not adequate for treating physically 
interesting systems of unbounded spins (Gaussian model, (^"^ model, SOS model, etc.). 
It is an open problem to devise a physically reasonable and adequately comprehensive 
space of interactions for unbounded-spin systems. We remark that any such space is 
unlikely to be a linear space, because it is perfectly possible for an interaction $ to 
be reasonable while — $ is unstable. Nor can it be a convex cone, because $ may be 
reasonable while A$ is unstable for A large and positive. However, such a space could 
conceivably be a convex subset of an appropriate linear space. 

2.4.5 The Observable /$ Corresponding to an Interaction $ 

Let $ be an interaction in B^ . Then it is useful to define an observable (= function) 
/$ that corresponds roughly to "the contribution to the energy from the neighborhood 
of the origin" : 

/$ = Y.\X\~'^x. (2.47) 
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It is obvious from the definition of S° that this sum is convergent in || • ||oo norm, and 
that ||/$||oo < II^IIb"- Note also that /$ is a quasilocal function, i.e. / G Cg;(0). 
This definition of /$ is not unique: one could equally well use instead 

/4 = Y. ^x (2.48) 

where X 3min denotes that is the smallest element of X in lexicographic order, 
or many other definitions [313, Section 3.2]. The important point is that all such 
definitions give the same value for the mean of /$ with respect to any translation- 
invariant measure (that is, they give the same "mean energy per site"); in other words, 
any two such definitions of /$ differ by an element of the space X defined in Proposition 
2.34. Therefore, what is defined naturally is not the map $ i— )• /$ of S° into C(0), but 
rather the map $ i— )• [/$] of S° into the quotient space C(0)/X. We can then define 
the following subspaces of S° 



Const = {$ 

J = {$ 

J + Const = {$ 

and the corresponding quotient (semi)norms 



/$ = constant} (2.49) 

U e ^} (2.50) 

U el + const} (2.51) 



\MBO/Const = inf ||$-*||bo (2.52) 

||$||ko/7 = inf 11$ - vplLo (2.53) 

\\^\\By(j+ Const) = inf ||$-*||bo (2.54) 

It is then not difficult to verify that: 

Proposition 2.40 Let Oq be a compact metric space. Then the map [$] i— )• [/$] is an 
isometry ofB^/J onto C(0)/J, and ofB°/{J + Const) onto C{n)/{I + const). 

2.4.6 Physical Equivalence in the Ruelle Sense 

The discussion in the preceding section motivates the following definition: 

Definition 2.41 Let $,$' G B^ . We say that $ and $' are physically equivalent in 
the Ruelle sense (f $ — $' G JT" + Const, i.e. if /$ — /$/ G X + const. 

Ruelle [313] was the first, to our knowledge, to highlight the central role played by the 
subspace X in the variational theory (see also [199, 209]). 

We have now defined two distinct notions of "physical equivalence" for interactions: 

• The DLR sense (Section 2.3.5), which is defined for arbitrary convergent (but not 
necessarily translation-invariant) interactions, and which guarantees the equality 
of the specifications (Theorem 2.17). 
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• The Ruelle sense, which is defined for arbitrary translation-invariant (but not nec- 
essarily absolutely summable or even convergent) interactions in S°, and which 
guarantees the equality of the family of equilibrium measures (Proposition 2.65 
below). 

It is natural to ask, therefore, whether these two notions are equivalent on their common 
domain of definition. The answer, fortunately, is yes: 

Theorem 2.42 Let the single-spin space Oq be a complete separable metric space, and 
let $,$' be interactions in B^ . Then $ and $' are physically equivalent in the DLR 
sense if and only if they are physically equivalent in the Ruelle sense. 



In Sections 3.3 and 5.1.2 we will need a version of Proposition 2.39 "modulo physical 
equivalence". Unfortunately, we have not been able to prove such a result for B^ (or 
any space B")^ and we do not know whether it it true. All we have is a result for spaces 
Bh of short-range interactions: 

Proposition 2.43 If h^ 1 and the single-spin space Oq is finite, then for each M < 
oo the sets {$: ||$||Bh/j" ^ M} and {$: ||$||Bh/(j"-i-Corast) ^ M} are closed subsets of 
B^. 

2.4.7 Estimates on Hamiltonians: Bulk versus Surface Effects 

We can now prove some estimates on the finite- volume Hamiltonians, which will play 
a key role both in the variational theory (Section 2.6) and in our applications to the 
renormalization group (Sections 3.2 and 3.3). The main physical idea in these estimates 
is to distinguish between "bulk" effects [namely, those which are of order |A|] and 
"surface" effects [those which are o(|A|)]. The upshot is that, provided one can control 
the surface contributions, many natural quantities are equivalent "in the bulk": this 
includes the Hamiltonians Hf j^^^ and Hf^, as well as the "Hamiltonian-like objects" 
Y.xdhT^J'^ and - log d fi a /dfi^. 

For free boundary conditions, it suffices to take $ in the "big" Banach space S°: 

Proposition 2.44 Let $ G B°. Then 

(a) 

iK/.eelloo < IMIMbO. (2.55) 

(b) 

iK/.eelloo = |A|||$||^o/^ + o(|A|) 

= |A|||M|c(Q)/j + o(|A|) (2.56) 

as A y oo (van Hove). 
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(c) 

II^A,/reel|c(fi)/const = |A| ||$||BO/(:^+Const) + o(|A|) 

= |A|||/$||c(fi)/(j+con.t) + o(|A|) (2.57) 



as A. y oo 


(van Hove). 






(d) 












-"AJree 


- > ; Tj^ 



< o(\A\) (2.58) 

no 

xeA 
as A y oo (van Hove). 

Note, in particular, part (d) of this proposition: since /$ is (roughly) "the contribution 
to the energy from the neighborhood of the origin", it follows that Y^xeA^xf^ ought 
to be (roughly) "the contribution to the energy from the volume A". And indeed it is: 
while this sum does not exactly equal Hf j^^^, it differs from it only by a "surface" term. 
In this sense, Y^xeA Txf^ can be thought of as yet another Hamiltonian for volume A, 
corresponding to some new type of "boundary condition" . 

In order to control the Hamiltonians with general external boundary conditions, it 
is necessary to take $ to lie in the "small" Banach space B^: 

Proposition 2.45 Let $ G S^ Then: 
(a) $ is absolutely summable, and 



(b) 



||i:^|||oo < |A|||$||bi . (2.59) 

I^aIIoo = |A|||$||,5o/:, + o(|A|) 

= |A|||M|c(Q)/j + o(|A|) (2.60) 



as A y oo (van Hove). 
(c) 

\\Hf^\\c{Q.)lconst = \M\\^\\B°/{J+Const) +o{\A\) 



\M\\U\\c(a)/{i+const) + o{\A\) (2.61) 



as A y oo (van Hove). 
(d) 



WWl^.Wco = \\Ht-Htj^J\^ = snp\\Hl^-Htj^J\^ < o{\A\) (2.62a) 

\\Ht-J2TxU\\oo < snp\\Hl^-J2TJ^\\^ < o(|A|) (2.62b) 
xeA ^£^ xeA 

as A y oo (van Hove). 

In summary, $ G S° suffices to control the Hamiltonian with free boundary condi- 
tions, but $ G S^ is needed in order to control the Hamiltonian with external boundary 
conditions and hence to apply the theory of specifications and Gibbs measures. 
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2.4.8 How to Obtain an Interaction from a Gibbs Measure 

If /i is a Gibbs measure for an interaction $ G S^, then the DLR equations permit us 
to read off the interaction $, modulo physical equivalence, from the measure //: 

Proposition 2.46 

(a) Let fi be a Gibbs measure (not necessarily translation-invariant) for an interaction 
^ eB\ Then 



i°g :7^ - E T.U 



C{n)/const 



< o(\A\ 



as A /* oo (van Hove). In fact, this bound is uniform for ji G ^(11*). 



(2.63) 



(b) Let /ii,/i2 be Gibbs measures (not necessarily translation-invariant) for interac- 
tions $i,$2 £ B^ , respectively. Then 



log 



J/i 



lA 



dji 



2A 



log 



J/i 



lA 



J/i 



2A 



C(U)lconst 



< 2\K\\\^r-^2\Wi{j+Const) + o(|A| 

= \^\\\^l-^2\Wl{J+Const) + o(|A|) 



(2.64) 



(2.65) 



as K /^ oo (van Hove). In fact, this bound is uniform for jii G ^(n^^) and 

/i2Ge^(n^^). 

Part (a) of this proposition tells us that the interaction can be reconstructed by 
taking the logarithm of the finite-volume densities. This corresponds to the fact that 
Boltzmann factors are exponentials of Hamiltonians. An immediate consequence of 
this is part (b). One implication of part (b) is that the reconstructed interaction is 
unique modulo physical equivalence (Griffiths-Ruelle theorem): just take /ii = /i2 in 
(2.65) to conclude that ||$i — ^2\\b° l(j+Const) = 0. In other words, if // is a Gibbs 
measure for interactions $i, $2 £ ■S^, then $1 and $2 must be physically equivalent in 
the Ruelle sense. Of course, we already knew this (Corollary 2.18 plus Theorem 2.42). 

It is curious that although $1, $2 are required to belong to the "small" Banach space 
B^ , the final estimate is in terms of the B^ j J norm, hence much stronger. The reason 
is that $1, $2 £ B^ is needed in order to ensure that the boundary energy contributions 
are indeed o(|A|); but once this is done, then the bulk energy contribution is determined 
by the B^ j J norm, as in Proposition 2.44(b). 

2.4.9 Translation-Invariant Specifications and Gibbs Measures 

We can now examine the theory of specifications and Gibbs measures under the hy- 
pothesis of translation invariance. 
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Definition 2.47 A specification 11 = (7rA)Ae5 is said to be translation-invariant if 

TTAiiO.A) = TTA+aiTaiO.TaA) (2.66) 

for all A e S, u e^, A e J^ and a ^TK 

In particular, if $ is a translation-invariant (and convergent, //^-admissible) interaction, 
then n* is obviously a translation-invariant specification. 

Fix a translation-invariant specification 11. We denote by Qinv(J^) = ^(H) fl 
^-l-i,mij(f^) the set of all translation-invariant measures consistent with 11. Qinv(J^) 
is a convex set, and its extreme points are characterized by the following theorem: 

Proposition 2.48 Let Jl he a translation-invariant specification. Then: 

(a) A measure fi G Qinv(^) is extremal in Qinv{^) if 'ind only if it is extremal in 
M+\^inv{^), i-S- if and only if it is ergodic. 

(b) Qinv(^) is a face of M^i^inv{^)-' that is, if fi^v G Af_|_i^m^,(0) and < A < f are 
such that Xfi -\- [1 — X)p G Qinv(^), then in fact /«, ^^ G Sinv(^)- 

It is now the right moment to make some remarks that may at first seem pedantic, 
but could actually be helpful to people haunted by an (unfortunately established) 
terminology that is confusing or at least bothersomely subtle. The situation is as 
follows. If the specification 11 is translation-invariant, we have at our disposal two 
different spaces of measures of physical interest: 

• ^(n), the space of all measures consistent with 11, whether or not they are 
translation-invariant; and 

• ^mij(n), the space of all translation-invariant measures consistent with 11. 

Physical "macrostates" are interpreted as extremal measures, but the question is: ex- 
tremal in which space? It is important to observe that we have three possibilities: 

(i) The extremal points of ^(H). These measures are characterized by very strict 
properties (Proposition 2.19): they show no fiuctuations for the observables mea- 
surable at infinity ("global observables") — which, for translation-invariant mea- 
sures, is larger than the set of translation-invariant observables ("macroscopic 
observables") — and they exhibit very strong cluster properties (short-range cor- 
relations). 

(ii) The translation-invariant extremal points of ^(H). This is often a small set, and 
in many cases it is empty. For example, in the two-dimensional Ising antiferro- 
magnet at low temperature, there are only two extremal Gibbs measures: one 
has -|- magnetization on the even sublattice and — magnetization on the odd sub- 
lattice (let us call this measure /U±), and the other has the reverse magnetization 
(call this measure /Uq:). Neither of these two measures is translation-invariant, so 
the set in question is empty. More dramatically, there are examples due to van 
Fnter and Miekisz [356] in which there are not even any periodic extremal Gibbs 
measures. 
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(iii) The extremal points of Qinv(^)- This is a much larger set than the one dis- 
cussed in (ii). In particular it is never empty for compact-spin models (see be- 
low). For instance, in the example of the Ising antiferromagnet, there is only 
one translation-invariant Gibbs measure — |(/W± + /^=f) — which is obviously 
extremal in Qinv(^) but not in ^(H). These measures satisfy the comparatively 
weaker properties of Proposition 2.30: they are deterministic for the smaller set 
of translation-invariant observables, and they exhibit the cluster property only 
in the weakest (Cesaro-averaged) sense, namely ergodicity. In the mathematical 
statistical-mechanics literature, these measures — the extremal points of ^^^^(n), 
or equivalently the ergodic elements of Qinv(^) — are called pure phases for the 
specification II. (Unfortunately, the term "pure phase" is sometimes used with 
different but closely related meanings: see e.g. Appendix B.3.I.) 

Which set is interpreted as representing the physical "macrostates" is a problem- 
dependent issue. In problems where non-translation-invariant measures are relevant 
(interfaces, surface tension, crystal shape, wetting, systems with disorder, quasicrys- 
tals), it is mandatory to consider the set ^(11) of all Gibbs measures. Then the 
"macrostates" should correspond to the measures in (i), and the translation-invariant 
"macrostates" should correspond to the measures in (ii). On the other hand, if one lim- 
its oneself to measuring bulk observables (i.e. macroscopic averages), then it is natural 
to consider only the translation-invariant Gibbs measures Qinv(^) and their extreme 
points: that is, (iii) is the natural choice [(ii) being often too small, e.g. empty]. In 
this regard, the use of the catchy label "pure phases" for the measures in (iii) is on the 
one hand natural, given the traditional interest in "macrostates" with symmetry under 
translations, but on the other hand unfortunate for the current interest in more general 
phenomena. A nomenclature more consistent with our purposes could be to call ex- 
tremal Gibbs measures those in (i), translation-invariant extremal Gibbs measures those 
in (ii), and just ergodic Gibbs measures those of (iii) [or extremal translation-invariant 
Gibbs measures, provided that we pay attention to the subtleties of word-ordering]. In 
any case, in the remainder of this paper we shall use the term "phase" or "pure phase" 
to denote the measures in (iii), with one exception: in Appendix B (and only there!) 
we shall succumb to the customary terminology of Pirogov-Sinai theory (as well as 
brevity) and use the term "pure phase" to denote the measures in (ii) [in fact a slight 
generalization of them]. 

Regarding the conditions under which the set Qinv{^) is non-empty, it suffices to 
mention a result analogous to Proposition 2.21 (Section 2.3.6): 

Proposition 2.49 Let ft be a compact metric space, and let II = (7rA)Ae5 be a 
translation-invariant Feller specification. Then Qinv{^) is nonempty. 

Because the translations form an Abelian group, this is an immediate consequence 
of Proposition 2.21 and the Markov- Kakutani theorem [104, 306]. The idea is that, 
given a measure // G ^(H), we can construct a measure in Qinv{^) by averaging // over 
translations (and extracting, if necessary, a convergent subsequence). 
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Remark. One would like to have a translation-invariant version of the Gibbs Rep- 
resentation Theorem (Theorem 2.12). That is, if 11 is a quasilocal, uniformly nonnuU 
and translation-invariant specification, one would like to prove that there exists an 
absolutely summable translation-invariant interaction $ such that 11 = 11*. However, 
it seems to be an open question whether this is true or not. Sullivan [336, Corollary 
to Theorem 2] constructed a translation-invariant $ which is ^^ relatively absolutely 
summable" (see Remark 2 at the end of Section 2.3.3), while Kozlov [222, Theorem 
3] constructed a translation-invariant absolutely summable $ under a condition on 11 
stronger than quasilocality."^° 

2.5 Entropy, Large Deviations and the Variational Principle: 
Finite- Volume Case 

31 

We now begin the study of the second approach to classical statistical mechanics, 
namely the one based on the variational principle^ which states that the Boltzmann- 
Gibbs measure is the one that maximizes entropy minus mean energy. The theory 
developed in this section is applicable to an arbitrary classical-statistical-mechanical 
system for which the Hamiltonian H makes sense. In practice this usually means a 
finite- volume system. First we introduce the free energy; next we introduce the concept 
of relative entropy and its interpretation in terms of large deviations; finally we prove 
the variational principle that connects these two quantities. In Section 2.6 we will 
develop the analogous theory for translation-invariant infinite- volume lattice systems. 

In this section we are working in a completely general classical-statistical-mechanical 
(= probabilistic) context: (0, E) is an arbitrary measurable space. 

2.5.1 Free Energy 

Definition 2.50 Let p be a probability measure on (0,E), and let f be a bounded 
measurable function on ft. We then define 

P{f\p) = log feUp . (2.67) 

Physically, P[f\p) is minus the free energy for a system with Hamiltonian H = —f 
and a priori measure p. Our choice of sign convention makes the formulae slightly 
more elegant. 

It is easy to prove the following properties of the free energy: 

Proposition 2.51 Let p be a probability measure on (0,E). Then P{-\p) has the 
following properties: 



^"^Kozlov's Theorem 3 uses (at least in the English translation) the words "necessary and sufficient", 
but in fact he proves only the sufficiency. 

^^References for this section are Georgii [157, Section 15.1], Israel [206, Section 1.2 and II. 2], Preston 
[299, Chapter 7] and Ellis [110, Chapters I, II, VII and VIII]. 
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(a) P(0|z/) = 0. 

(b) f<g =^ P{fW)<P{gW). 

(c) P[f + c\p) = P[f\p) + c for any real number c. 

(d) P{f\v) — P{g\v) < 11/ — g\\co- That is, P( • \p) is Lipschitz continuous with 
Lipschitz constant 1. 

(e) P{-\v) is convex. 

(f) P{' k) ^-^ strictly convex in directions corresponding to functions which are not 
v-a.e. constant. 

2.5.2 Relative Entropy 

Definition 2.52 Let fi and v he any two probability measures on (0,E). Then the 
relative entropy (or information gain or KuUback-Leibler information^ of fi relative to 
V is defined as 

/(/i|z/) = }J \ ^d^J f"' J Kdu ^du) J^'^ (2.68) 

I +00 otherwise 

More generally, if A is any sub-a-field ofYj, then we define 

Ia{ij]v) = lU\A u\A) . (2.69) 



Actually, our I[fi\p) is the negative of the usual relative entropy S'(/i|z/); but it is more 
convenient to work with / than with S*, and it is too cumbersome to keep saying the 
words "negative of". So we shall just call / the "relative entropy" tout court. But this 
sign difference should be borne in mind when interpreting the variational principle! 
(See also the Remarks at the end of this subsection for a comparison with the usual 
physicists' entropy.) 

It is not hard to prove the following properties of the relative entropy: 
Proposition 2.53 Let //, ;/ be probability mea.sures on (0,E). Then: 

(a) < I{lJi\v) < Lmax = -\ogv,rmn, whcrc v,rmn = mf v{A). [For cxamplc, if P 

is normalized counting mea.sure on a finite .space 0, then Imax = log |0|. // is 
an infinite .space, then Imax = +oo-7 

(b) I{jj\v) = if and only if fi = v. 

(c) I{jj\v) is a convex function of the pair (fi^p). 
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(d) For fixed v, I{ii\v) is ^^almosV^ a concave function of ji, in the sense that 

n n n 

I[Y.\,^i,\v) > ^A,/(/i,|//) + ^AaogA, (2.70a) 

8=1 8=1 

n 

> ^A,/(/i,|//) - logn (2.70b) 



8 = 1 8 = 1 8 = 1 



8 = 1 



for any probability measures /ii, . . . , //„ and numbers Ai, . . . , A„ > with Yl^=i ^i = 
1. 

(e) For fixed v, I{jj\v) is a lower semicontinuous function of fi in the bounded mea- 
surable topologif"^ , and in the weak topology if is a complete separable metric 
space. 

(f) For fixed v and fixed c < oo, the set {fi: I[fi\p) < c} is compact and sequentially 
compact in the bounded measurable topology (and hence also in the weak topology). 

(9) lA{f'W) is an increasing function of A. 

(h) If A\ C A2 C S, and //^ (resp. Vj^ ) is a regular conditional probability for fi 
(resp. v) given A\, then 



UM^) = UMv) + J dfi{u) I^.ifiXKJ ■ (2-71) 

[This obviously refines (g).] 

(i) (Strong superadditivity) Let A\^ A^^ Az be sub-a -fields ofYj which are independent 
with respect to v. Then 



Remarks. 1. The standard statistical-mechanics textbooks (e.g. [213, Chapters 2, 
4 and 5], [307, Section 9.B], [20, Chapter 3]) introduce a quantity which is apparently 
the entropy of a single measure //, without reference to a base measure p: 

{—J2 ti-cu log ficu if ^ is discrete 

r . . .X . . . (2.73) 

— J fi[x) log fi[x) dx if is continuous 

However, closer examination reveals that a base measure v has been introduced surrep- 
titiously in these formulae, namely counting measure in the discrete case or Lebesgue 
measure in the continuous case. This base measure does play a physical role in the 
theory: the physics would be different if counting or Lebesgue measure were replaced 



^^Recall that a net {//«} converges to ft in the bounded measurable topology if J f dfia -^ f f dfi 
for all/ e 5(fi,S). 
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by some other measure. "^"^ Thus, the formulae (2.73), in which the base measure v is 
hidden, are quite misleading. (They are also inelegant, as can be seen from the in- 
compatible treatment given to the discrete and continuous cases.) What is involved 
here is the common sin of failing to distinguish between a measure and a density 
(= Radon-Nikodym derivative): the latter is defined only relative to a specified base 
measure. In many situations, this sin is harmless, because there is a "natural" and 
universally-agreed choice of base measure. But not here. We therefore feel strongly 
that in statistical mechanics the base measure v should be introduced explicitly. 

Note also that the definition (2.73) uses unnormalized counting or Lebesgue measure 
as the base measure, while we always take the base measure ;/ to be a probability 
measure. This causes an (irrelevant) additive shift in the entropy: e.g. for finite, 

/(/i|//) = -Sbooksip^) + log |0| (2.74) 

when p is normalized counting measure [//({cu}) = 1/|0| for each io ^ ft]. Thus, both 
I{fi\p) and Shooks{f') take values in the interval [O,log |0|], but large values of I{jj\v) 
correspond to small values of Shooks{f')^ and vice versa. 

The reader is urged to remember the two notational differences — the sign and the 
additive constant — when interpreting our results. 

2. The relative entropy I{jj\v) plays an important role in information theory and 
in mathematical statistics (large-sample asymptotic theory of hypothesis testing and 
maximum-likelihood estimation); this follows from the large-deviations theory to be 
discussed in the next subsection. See e.g. [30, pp. II9-I25] and [226, 225, 16]. The 
relationship with maximum-likelihood estimation is discussed also in Section 5.1.2 be- 
low. 

2.5.3 Large Deviations 

The physical interpretation of relative entropy is associated with the problem of large 
deviations , which concerns, roughly speaking, the estimation of the (very small) prob- 
abilities of large simultaneous fiuctuations in a system consisting of a large number of 
random variables. In this section we will consider the case of independent, identically 
distributed (i.i.d.) random variables. So let Xi,X2, ... be a sequence of independent 
samples from the probability distribution v] and let / be any bounded real- valued mea- 
surable function on 0. Then /(Xi), /(X2), ... is a sequence of independent, identically 
distributed real- valued random variables. In such a situation the weak law of large 
numbers states that the sample mean Sl^ = n~^ J2^=i fi^i) is, with high probability, 
very close to the theoretical mean value m = Jf dp: more precisely, if A is any closed 



^^In some cases, counting or Lebesgue measure may play a privileged role by virtue of some sym- 
metry: e.g. spin-flip symmetry in the Ising model, or symplectic symmetry in a classical Hamiltonian 
system. In other cases, however, the privileged measure could be some other measure: e.g. Haar 
measure on a Lie group is not Lebesgue measure except in some very special parametrizations. This 
is yet another reason for making the base measure i> explicit: it clarifies whether or not there is a 
symmetry argument that privileges one choice of i> over another. 
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subset of the real line not containing m, then Prob(S';{ G A) — )• as n — )• oo. Large- 
deviation theorems [360, 110, 75] are a strengthening of the weak law of large numbers, 
in that they give the precise rate of convergence of this probability to zero as n — )• oo. 
It turns out that this probability is exponentially small in n, that is, 

PlohiSl G A) - ,-nXconst(f,.,A) 



~ e 



(2.75) 



where const (/,;/, A) > whenever A is a closed set not containing m. More precisely, 
it can be shown that 



1 



lim — logProb(S';( G A) 



n — >oo 12 



< 



> 



inf I[fi\p) if A is a closed set 

IJ.: ffd/ieA 

inf I[fi\p) if A is an open set 



(2.76) 



where I{jj\v) is the relative entropy. 

In the preceding thought-experiment, we looked at only one real- valued observable 
/. More generally, we could look at a vector-valued observable f = (/i, . . . ,/fc), and 
ask for the probability that S\ lies in some subset A C R . Not surprisingly we have 

if A is a closed set 



1 f 

lim -logProb(5'„ G A) 



< 



> 



inf I[fi\p) 

/i: JfdfieA 

inf I[fi\p) if A is an open set 

/i: JfdfieA 



(2.77) 



These results can be written in a more succinct way by noting the trivial identity 



1 



E/(^»- 



1 



E^xMf) 



(2.78) 

(2.79) 
(2.80) 



8 = 1 \ 8 = 1 / 

(here 6^ is the delta measure at x), which can be written as 

Si = Uf) = JfdL^ 
where 

n 

8 = 1 

is called the empirical measure. We emphasize that Ln is a random, measure: it depends 
on the random sample Xi, . . . ,X„. In this language, the weak law of large numbers 
can be reformulated as saying that the empirical measure Ln is, with high probability, 
very close to the theoretical measure ;/, when "closeness" is understood in the bounded 
measurable topology (that is, the weak topology generated by the bounded measurable 
functions). More precisely, if A is any closed subset of Af_|_i(0) not containing ;/, then 
Prob(iv„ G A) — )• as n — )• oo."^'* The large-deviation theorem [176, 69, 34] then states 
that this probability is in fact exponentially small in n, namely 

— riXconst(i^,A) 



Prob(iv„ G A) 



2.81 



34 



In this particular topology, a basis for the neighborhoods of i> is given by the sets 



5 f 



fi d^i- I fi dv 



< e for all i = 1, 
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where const(z/, A) > whenever A is a closed set of measures not containing v. More 
precisely, 



lim — logProb(iv„ G A) < 



' < — inf I{jj\v) if A is a closed set 

/i: /igA 

> — inf I{jj\v) if A is an open set 

/i: /igA 



f2.82) 



In fact, this result is merely a sophisticated restatement of (2.77), since every closed 
(resp. open) set of measures A is contained in (resp. contains) one of the form {//: / f J/i G 
A} for some f = (/i, . . . , fk) and some A closed (resp. open) C R . 

Formulas (2.81)/(2.82) provide a physical interpretation of the relative entropy. 
Indeed, we can say (roughly speaking) that the probability that a sample Xi, . . . ,X„, 
taken from the probability distribution ;/, "looks like a typical sample from //" decays 
exponentially with rate I{jj\v): 



Probj,(Xi, . . . , Xn is typical for //) ~ e 



-nl(^i\v) 



(2.83) 



In the probabilistic literature, (2.76)/(2.77) are called level-1 large-deviation for- 
mulae, and (2.82) is called a level-2 large-deviation formula. 

2.5.4 Variational Principle 

The free energy and the relative entropy are related by the following variational prin- 
ciple: 

Theorem 2.54 (Variational principle) Fix a probability measure u o?i(0,E). Then 
P[- \p) and I[- \p) are conjugate convex functions, in the sense that 



PiJW) 



sup 

AtgM+i(fi,S) 



sup 

/gB(fi,S) L 



j fdfi - I{ji\i 
Jfdfi - P{f\j.) 



(2. 84a) 



(2.84b) 



Moreover, the supremum is achieved if and only if fi equals the Boltzmann-Gibbs mea- 
sure for Hamiltonian H = — f (and a priori measure v), namely 



f'BGJ,! 



e^ du 
J e^ dp 



(2.85) 



where f = (/i, . . . , fk) runs over all finite families of bounded measurable functions, and e runs over 
all strictly positive numbers. By the usual weak law of large numbers we have 

k 

Prob(i.„ ^ B^fJ < J2Proh{\Sl' - mil > ^ 

8 = 1 

as n ^ cxD, since k is finite. Since any closed set A ^ i^ is contained in the complement of some set 
B f , the claim Prob(_L„ G A) ^ is proven. 
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This complementary pair of variational principles establishes the equivalence of (2.1) 
and (2.3) for finite-volume statistical-mechanical systems. Indeed, Jfdfi is minus the 
mean energy for a system with Hamiltonian H = — /, and I[fi\p) is minus the entropy; 
therefore, (2.84a) states that the Boltzmann-Gibbs measure is the one that minimizes 
energy minus entropy, and that the minimum value of energy minus entropy equals 
the free energy. (In thermodynamic notation, F = E — TS; recall that we are taking 
/3 = 1.) 

2.6 Entropy, Large Deviations and the Variational Principle: 
Infinite- Volume Case 

35 

The variational approach developed in the preceding section is adequate for finite- 
volume statistical-mechanical systems, in which the Hamiltonian H is well-defined and 
finite. But it is (not surprisingly) insufficient for the infinite- volume case, in which all 
the relevant quantities — Hamiltonian, free energy, mean energy and relative entropy 
— are almost certainly infinite. Nevertheless, one might hope that for translation- 
invariant infinite- volume systems there would exist an analogous theory in which the 
concepts of free energy, mean energy and relative entropy are replaced by these same 
quantities per unit volume; one could then define an equilibrium, measure to be a 
translation-invariant measure that maximizes the entropy density minus mean energy 
density. In this section we shall develop such a theory. But this infinite- volume theory 
is considerably more subtle than its finite- volume counterpart: this subtlety arises from 
the physical possibility of phase transitions, as well as from additional mathematical 
pathologies to be explained in Section 2.6.7 below. 

The variational approach to infinite- volume lattice systems is less general than the 
one based on the DLR equations, because of its restriction to translation-invariant 
measures"^^, but within its restricted domain it is equivalent to the DLR theory: the 
key theorem (Corollary 2.68) states that, for any interaction $ G B^ , the equilibrium 
measures coincide with the translation-invariant Gibbs measures. 

2.6.1 Free Energy Density ("Pressure") 

We look first at the free energy density, or what is equivalent, the "pressure": 

Definition 2.55 Let v he a translation-invariant probability measure on ft = (Oq)^ , 
and let f be a bounded measurable function. Then the pressure of / relative to v is 



^^References for this section are Georgii [157, Chapters 15 and 16], Israel [206, Chapters I, II and 
V], Preston [299, Chapters 7 and 8], Ruelle [313, Chapters 3 and 4] and Ellis [110, Chapters IV and 
V and Appendix C]. 

^®Even if the interaction is translation-invariant, there may exist non-translation-invariant Gibbs 
measures (e.g. for the Ising model in dimension rf > 3 [85, 348]), and these are of interest in describing 
interfaces. 
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defined as 



K/k) = i™ — loglexp 



xec'n 



dp 



(2.86) 



if this limit exists. Similarly, if $ is an interaction in B^ , then the pressure of $ 
relative to v is defined as 



1 r 



(2.87) 



if this limit exists 



This quantity should really be called "minus the free energy density" . The term "pres- 
sure" arises from the interpretation of the canonical-ensemble Ising model as equivalent 
to a grand-canonical-ensemble lattice gas; in the general case the term "pressure" is 
not really appropriate, but it has become standard among mathematical physicists. It 
has, at least, the virtue of brevity. 

We emphasize that the existence of the limit (2.86) [or (2.87)] is a nontrivial prob- 
lem; in fact, there exist examples of translation-invariant measures v for which the limit 
does not exist, even for simple local functions / (see Appendix A. 5. 2). Therefore, we 
shall restrict attention to two cases: when v is a, product measure, and more generally, 
when V is a, Gibbs measure for a translation-invariant interaction. 

Proposition 2.56 Let v he a product measure. Then the pressure p{f\v) exists for all 
hounded quasilocal functions f ; in fact, the limit exists also in van Hove sense, namely 



p{f\i 



lini TTT log / exp 

A/oo A J 



.xeA 



dp 



(2.88) 



Moreover, p[- \p) has the following properties: 

(a) p{0\u) = 0. 

(b) f<g =^ p{fW)<p{9W). 

(c) p[f -\- c\p) = p{f\i') + c for any real numher c. 

(d) p{f\i') — p{g\v) < 11/ — g\\oo- That is, p{-\v) is Lipschitz continuous with 
Lipschitz constant 1. 

(e) p[f -\- h\p) = p{f\i') for any h ^ X. 

(f) p{' \^) ^-^ convex. 

We emphasize, in particular, part (e): the pressure is constant within "subspaces of 
physical equivalence" . 
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Proposition 2.57 Let p be a translation-invariant Gibbs measure for an interaction 
$ G S^ (and a priori measure ji^). Then the pressure p{f\v) exists for all bounded 
quasilocal functions f ; in fact, the limit exists also in van Hove sense, namely 



PifW) = lip TTTl°g / ^^P 

A/oo A J 



.xeA 



dv . (2.89) 



Moreover, the limit is given by 

p{fW) = p{f - M/) - K-M/) • (2.90) 

In particular, p[ ■ \p) has all the properties (a)-(f) of Proposition 2.56. 

The pressure of a function f is the simplest object from a mathematical point of 
view, but the pressure of an interaction $ is perhaps more familiar to physicists. In 
fact these two objects are essentially identical: 

Proposition 2.58 Let $ G S°, and let p be a translation-invariant mea.sure satisfying 
the conditions of Proposition 2.56 or 2.51. Then: 

(a) p[^\p) exists and equals p[—f^\p). In fact, the limit exists also in van Hove 
sense, i.e. 



Vm^) = lim ^log /exp [-</.ee] d^ ■ (2-91) 



(b) If in addition $ G B^ , then for any r G 0, lim j-r-r log / exp — i7^^ du also 
exists and equals p[—f^\p). 

Part (b) states that, for interactions $ G S^, the pressure is independent of boundary 
conditions. 

The reader will note that we have not asserted the strict convexity of p[- |z/); this is 
because, in sharp contrast to the finite- volume case, the infinite- volume pressure is not 
strictly convex (not even modulo physical equivalence). Indeed, this failure of strict 
convexity is at the origin of some rather surprising pathologies of the infinite- volume 
variational theory in the "large" space of interactions S° (see Section 2.6.7 below). 
However, in the smaller space B^ these pathologies do not arise: 

Proposition 2.59 (Griffiths— Ruelle [174]) Let u be a translation-invariant mea- 
sure satisfying the conditions of Proposition 2.56 or 2.57. Then the pressure p[- \p), 
restricted to the space of interactions B^ , is strictly convex in directions ^ J -\- Const. 
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Note, in particular, the contrapositive of this proposition: ii p{- {p) is not strictly 
convex on B^ in directions ^ J -\- Const, then v is not the Gibbs measure for any 
interaction in B^. This gives a method for proving non-Gibbsianness, which will be 
exploited in Section 4.4. 

Remark. The failure of strict convexity in S° was first pointed out by Fisher 
[116], who provided a family of exactly soluble one-dimensional Ising models in which 
the pressure can be explicitly seen to have straight segments. These models are lattice 
versions of the Fisher-Felderhof [120, 121, 114, 113] cluster models. The failure of strict 
convexity can here be given a physical interpretation in terms of the formation of a 
perfectly rigid crystal. This indicates that S° \ B^ does contain some interactions of 
physical interest, if only for their rather strange thermodynamic properties. 

2.6.2 Relative Entropy Density 

For brevity we henceforth write the relative entropy in volume A as /a(/u|z^), instead of 
the more pedantic /jr^(/i|z/). We now define the relative entropy density: 

Definition 2.60 Let //, ly be translation- invariant probability measures on 0, = (Oq) . 

The relative entropy density (or relative entropy per unit volumej of fi relative to p is 

defined as 

1 

n — >oo fi 

if this limit exists. 



i{lj]v) = Yiva —IcM^) (2.92) 



We emphasize that the existence of the limit (2.92) is a nontrivial problem; in fact, 
there exist examples of translation-invariant measures //, v for which the limit does 
not exist (see Appendix A. 5. 2). Therefore, just as for the pressure, we shall restrict 
attention to two cases: when v is a, product measure, and more generally, when v is a, 
Gibbs measure for a translation-invariant interaction. 

Proposition 2.61 Let v be a product measure. Then the relative entropy density 
i{jj\v) exists for all translation-invariant probability measures fi; in fact, the limit exists 
in van Hove sense and also as a supremum: 

i{ji\v) = lim ^/A(/ik) (2.93a) 

A/^oo |A| 

= sup -—I{^{ji\v) . (2.93b) 

Aes |A| 

Moreover, i{fi\i') has the following properties: 

(a) < i{jJ,\p) < imax = -\ogv,rmnfi, whcrc p^tufl = mf i^{A). [For example, 

if V is the product of normalized counting measure on a finite single-spin space 
Oo; then imax = log |0o|. If the single-spin space Oq is infinite, then imax = +oo-7 
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(b) ?(//|/^) ^5 an affine function of ji, i.e. 

n n 

Va,//, |z/) = ^A,z(/i,|z/) (2.94) 



8=1 8=1 



for any measures fii^...^fin £ M^nnvi^) and numbers Ai,...,A„ > with 

Er=iA8 = i. 

(c) For fixed v, i{jj\v) is a lower semicontinuous function of fi in the bounded quasilo- 
cal topologip^ , and in the weak quasilocal topology^^ if Oq i-s a complete separable 
metric space. 

(d) For any fi, there exists a sequence {fin)n>i such that fin ^^ f- in the bounded 
quasilocal topology, and z(/i„|z/) = imax for all n. It follows that i{fi\i') is a 
discontinuous function of fi in the bounded quasilocal topology (and hence also in 
the weak quasilocal topology) at each fi satisfying i{fi\i') < imax- 

(e) For any fi, there exists a sequence {fin)n>i o/ ergo die measures such that fin — ^ 
fi in the bounded quasilocal topology, and z(/i„|z/) | i[fi\p). [This strengthens 
Proposition 2.32.] 

(f) For fixed v and fixed c < oo, the set {fi: i{fi\i') < c} is compact and sequentially 
compact in the bounded quasilocal topology (and hence also in the weak quasilocal 
topology), at least if fto is a complete separable metric .space. 

It is quite remarkable that the relative entropy density z( • \v) is an affine function. 
This comes from the fact that the relative entropy I{-\v) is not only convex, but also 
concave within a A-independent additive constant; and this constant disappears when 
considering the entropy per unit volume in the infinite- volume limit. This affineness of 
z( • \v) makes the infinite- volume variational theory quite different from its finite- volume 
counterpart. 

Proposition 2.62 Let v be a translation-invariant Gibbs mea.sure for an interaction 
$ G S^ (and a priori mea.sure fi^). Then the relative entropy density i{fi\i') exists for 
all translation-invariant probability mea.sures fir, in fact, the limit exists also in van 
Hove sense, namely 

i{fi\v) = lim ^/A(/ik) . (2.95) 



^^Recall that a net {//«} converges to fi in the bounded quasilocal topology if J f dfia -^ J f dfi for 
all / G Bqi(Q). In Georgii [157], this topology is called the "topology of local convergence" or the 
"£-topology" . See [157, Chapter 4] for properties of this topology. 

^^Recall that a net {//«} converges to ft in the weak quasilocal topology if ffdfia -^ ff dfi for 
all / G Cqi(^). If ilo is a compact metric space, then Cqi(^) = C(il), and so the weak quasilocal 
topology coincides with the usual weak topology. 
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Moreover, this limit is given by 

z(/i|z/) = z(/u|/U°) + p(-/$|/«°) + UdfJ.. (2.96) 

Moreover, z( • \p) has properties (h) and (c) of Proposition 2.61. 

Note that, by (2.96), the relative entropy density z( • \v) depends on v only via the 
interaction $: that is, if Vi and V2 are translation-invariant Gibbs measures for the 
same interaction $ G B^ , we have i{jj\vi) = i{jj\v2) for all //. 

The reader will note that we have not asserted that i{jj\v) = if and only if // = ;/. 
Indeed, this naive conjecture is false: as we have just seen, i{jj\v) = also holds 
whenever // and v are translation-invariant Gibbs measures for the same interaction. 
In Section 2.6.6 we shall show that, roughly speaking, i{jj\v) = only when // and v 
are Gibbs measures for the same interaction. This fact will play a crucial role in the 
proof of the First Fundamental Theorem (see Section 3.2). 

Remark. We have proven the existence of i{jj\v) when v is a, Gibbs measure, 
but this does not exhaust the cases for which i{jj\v) exists. Indeed, by combining 
Theorem 3.4 with our construction in Section 4.1, we provide an explicit example of 
non-Gibbsian translation-invariant measures // and v for which i{jj\v) exists (and is 
in fact zero): namely, // (resp. v) is the image of the -|- (resp. — ) phase of the two- 
dimensional Ising model (at low enough temperature) under the 6 = 2 decimation 
transformation. It is an interesting (and probably difficult) mathematical problem to 
characterize the pairs {jJ^^v) for which i{jj\v) exists. 

2.6.3 Large Deviations 

In Section 2.5.3 we developed the theory of large deviations for independent repetitions 
of an arbitrary probabilistic experiment. This theory provided a physical (and statis- 
tical) interpretation for the concept of relative entropy. It is natural to ask whether 
there is an analogue, for translation-invariant measures on a classical lattice system, in 
which "time averages" are replaced by "space averages". That is, instead of considering 
large deviations for the sample mean in a large number of independent repetitions of 
the same experiment, one might instead consider large deviations from .spatial means 
(physically, large fluctuations of extensive quantities) in a single infinite-volume re- 
alization. Such a large-deviation theory would then, it is hoped, provide a physical 
interpretation of the relative entropy density. 

In this section we describe (without proof!) the basic features of such a large- 
deviation theory. We emphasize that this theory is much more subtle than the theory 
for the independent-repetitions case, because the spins in disjoint regions of space need 
not be probabilistically independent. Indeed, for general translation-invariant measures 
on 0, no satisfactory large-deviation theory is known. Therefore, we shall restrict 
attention to the case in which p is an ergodic translation-invariant Gibbs measure for 
an interaction $ G S^. Our exposition is based on the recent work of FoUmer and Orey 
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[124], OUa [282, 283], Comets [65] and Georgii [155] (see also [51]), which in turn is 
inspired by the pioneering work of Donsker and Varadhan [99, 100, 101, 102, 360]. In 
the physics hterature, the relation between thermodynamics and large deviations was 
pointed out long ago by Lanford [230]. 

If / is a bounded measurable function on 0, then the mean ergodic theorem states 
that the spatial averages S'l = |A|~^ J2aeA Taf converge in L^{}i) norm to the expected 
value m = jfdv, as A /* oo. In particular, if A is any closed subset of the real 
line not containing m^ then Prob(S'^ G A) — )• as A /* oo. The mean ergodic 
theorem is, therefore, a natural generalization of the weak law of large numbers. The 
large-deviation theorems strengthen the ergodic theorem by giving a precise rate of 
convergence of Prob(S';( G A) to zero as A /* oo. 

If we first restrict attention to single-site observables / (i.e. functions of a single 
spin), then the large-deviation theorems for spatial averages (level 1) and for the single- 
site empirical measure (level 2) are direct analogues of (2.76) and (2.82):"^^ 



1 . 

lim — — ■ logProb(S'^ G A) 

A/'oo A 



< — inf Kf'W) if A is a closed set 

> — inf Kf'W) if A is an open set 



(2.97) 



and 



hm --- log FrobfivA G A) 

A/oo A ^ ' 



< — inf ^(/^l'^) if A is a closed set 

/ieM+i_,„„(fi): /i |"^{o}eA 

> — inf ^(/^l'^) if A is an open set 

/ieM+i_,„„(fi): /i |"^{o}eA 

(2.98) 
where i{fi\i') is the relative entropy density, and for each configuration io the single-site 
empirical measure in volume A is defined to be L^ = |A|~^ Y^ieA ^tur 

The empirical measure L^ is a tool for studying events occurring at a single site 
only. These events would completely characterize the measure if it were a product 
measure (as in the i.i.d. case studied in Section 2.5.3), but in the general case one 
clearly needs multi-site observables (i.e. functions of several spins) in order to describe 
correlations. The study of such observables gives rise to the "level-3" large-deviation 
theory. It is based on the trivial identity 



^E(r./)(c.) = f^E'^TJ(/), 

si = i?A(/) = JfdRA (2.100) 



_ _ (2.99) 

^^1 aeA M^^l aeA / 

which can be written as 



where 



Ra = \A\-'J2^T.^ (2-101) 

aeA 



^^In the mathematical literature the large-deviation theorems are usually proven for sequences of 
cubes, but the same arguments ought to work for general van Hove sequences. 

67 



is called the empirical field. We emphasize that R^ is a random measure (on the 
infinite- volume configuration space 0), since it depends on the random configuration 
id. In this language, the ergodic theorem can be reformulated as implying that the em- 
pirical field i?A is, with high probability, very close to the theoretical measure ;/, when 
"closeness" is understood in the bounded quasilocal topology (i.e. the weak topology 
generated by the bounded quasilocal functions). More precisely, if A is any closed subset 
of Af_|_i(0) not containing ;/, then Prob(i?A G A) — )• as A /* oo. The large-deviation 
theorem [155] then states that this probability is in fact exponentially small in |A|, 
namely 

Prob(i?A G A) ~ e-\Mconst{.,K) ^2.102) 

where const(z/. A) > whenever A is a closed subset of Af_|_i(0) not containing p. In 
detail. 



lim --- logProb(i?A G A) < 

A/oo A ° ^ ^ 



' < — inf ^(/^l'^) if A is a closed set 

At: AteAnM+i_,„„(fi) 

> — inf ^(/^l'^) if A is an open set 



(2.103) 



These formulae provide a physical interpretation for the relative entropy density. 
Roughly speaking, the probability that a configuration cu, taken from the probability 
distribution ;/, "looks in A like a typical configuration from //" decays exponentially in 
the volume of A with rate i[fi\p): 



Prob 



j,(cua is typical for //) 



-\MHpW) 



(2.104) 



This interpretation of the relative entropy density will play a key role in motivating 
the First Fundamental Theorem (Section 3.2). 



?(per) 



Remarks. 1. Some of the large-deviation theorems use a periodized empirical field 



(per) 



to 



i?A , which is a translation-invariant measure on 0. One expects i?A and i?^ 
behave in the same way. 

2. Our results in Section 4 give examples of some non-Gibbsian measures p for which 
a large-deviations theory can be developed, e.g. p = pT where p is a two-dimensional 
Ising-model Gibbs measure at low temperature, and T is a suitable renormalization 
map. Of course, one is able to control the large deviations for p only by reducing it to 
the same problem for the better-behaved measure p. 



2.6.4 Variational Principle 

The pressure and the relative entropy density are related by the following variational 
principle: 

Theorem 2.63 (Variational principle) Fix a translation-invariant measure p sat- 
isfying the conditions of Proposition 2.56 or 2.57. Then p{-\p) and z( • \p) are conjugate 
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convex functions, in the sense that 



p{fW) 



sup 



sup 



fdji - i{ji\p) 
fdfj.- p{f\iy) 



r2.105a) 



(2.105b) 



Written in terms of interactions, this reads 






sup 



sup 



fq,dfi 



i{fi\i 



r2.106a) 



(2.106b) 



This variational principle gives us another way to associate (infinite- volume) prob- 
ability measures to a given interaction: 

Definition 2.64 Let $ G S° and ji G Af+i^m^,(0). We say that ji is an equilibrium 
measure for $ (and a priori measure iJ^ ) if the pair {^^fi) saturates the variational 
principle (2.106) with v = /i°, i.e. if 



p($|/) + z(/.|/) 



/$ J/i 



(2.107) 



We have now laid out two distinct approaches to infinite- volume physics: 
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The DLR approach^ which says what it means for a (not necessarily translation- 
invariant) measure // to be a Gibbs mea.sure for a convergent and //^-admissible 
(but not necessarily translation-invariant) interaction $. This is the infinite- 
volume analogue of the explicit formula (2.1). This approach is constructed 
purely on the basis of probability theory, and hence it can be called the .statistical- 
mechanical approach. 

2) The variational approach, which says what it means for a translation-invariant 
measure // to be an equilibrium. mea.sure for a translation-invariant (but not nec- 
essarily convergent) interaction $. This is the infinite- volume analogue of the 
variational principle (2.3). This approach is based on optimization of thermo- 
dynamic potentials, and hence it can be called the thermodynamic approach. 
However, as remarked by Wightman [363], conventional thermodynamics refers 
to the optimization of potentials with respect to a rather reduced number of pa- 
rameters (temperature, chemical potential, etc.). In contrast, the optimization 
of the previous proposition is with respect to an infinite-dimensional space of 
possible interactions. 

For translation-invariant interactions $ and translation-invariant measures //, this 
means in practice the following: The DLR approach applies to a more restricted class of 
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interactions, but in return provides much more information on the measures. That is, 
it requires $ G B^, but gives strong control on fi via the DLR equations (2.21)/(2.22). 
On the other hand, the variational approach needs only $ G S°, but provides much 
weaker control over fi. In any case, the two approaches are equivalent in their common 
domain of applicability: if $ is a translation-invariant interaction in B^ and // is a 
translation-invariant measure, then // is a Gibbs measure for $ if and only if it is an 
equilibrium measure for $. We will prove this in Corollary 2.68 below. 

At this point, the reader may be wondering: If the DLR and variational approaches 
are equivalent (for interactions in B^)^ then why bother introducing both of them? 
Why not stick with one or the other, and shorten this article by at least 30 pages? 
The answer is that many deep results are based on the interplay between DLR and 
variational ideas. This is the case for Theorem 2.67 below, and it is also the case for 
many of our RG results (notably those in Sections 3.2, 3.3 and 4.4). 

Before leaving the subject of the variational principle, let us note a simple corollary. 
Let -F(/U, $) be the amount by which the pair (//, $) fails to satisfy the variational 
principle, i.e. 

F(/i,$) = p($|/) + z(/i|/) + lUdji > 0. (2.108) 

Then it is easy to see that 

|F(/i,$)-F(/i,$')l < 2\\^-^'\\so/^j+const); (2.109) 

indeed, this is an immediate consequence of Propositions 2.56(c)-(e) and 2.58(a). In 
particular, if fi is an equilibrium measure for $, then the amount by which fi fails to 
satisfy the variational principle for $' is at most 2||$ — $'||B''/(j"-i-Cons<)- Note also that 
if $ G S^, then F[fi^ $) can be interpreted as a relative entropy: 

F(/i,$) = z(/i|//) for any // G ^,„4n*) . (2.110) 

This is the content of equation (2.96). 

A special case of (2.109) [which is also easy to see directly] is the following: 

Proposition 2.65 Let $,$ G S° be physically equivalent in the Ruelle sense (i.e. 
$ — $' G JT" -|- Const). Then $ and $' have exactly the same equilibrium. mea.sures. 

2.6.5 What is a Phase Transition? 

Informally, the occurrence of a phase transition is associated to one or both of the fol- 
lowing phenomena: a singularity of some thermodynamic potential and/or the change 
in the number of "macrostates" available to the system. Historically, the first point 
of view was primarily associated with Ehrenfest, while the second point of view was 
primarily associated with Gibbs. However, the full formalization of the second point of 
view — in particular, giving a precise meaning to "macrostate" — and the clarification 
of the relation between these thermodynamic concepts and the underlying (microscopic) 
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statistical-mechanical concepts had to await the development of the DLR and rigorous 
variational approaches. 

The general interpretation of phase transitions as singularities of the (what turned 
out to be infinite- volume) free energy (= pressure) gave rise to the Ehrenfest classifi- 
cation: a system is said to exhibit an n^^ -order phase transition if some n^^ derivative 
of the free energy is discontinuous (and all the derivatives of lower order are contin- 
uous). For example, the two-dimensional Ising model at low temperatures undergoes 
a first-order phase transition as the magnetic field passes through zero, because the 
magnetization (= first derivative of the free energy with respect to the field) has a 
discontinuity. On the other hand, if the field is kept equal to zero and the temperature 
is lowered (starting from a high value), the system undergoes a second-order phase 
transition at the critical temperature, because the magnetization and energy (= first 
derivatives of the free energy) remain continuous but the susceptibility and specific 
heat (= second derivatives of the free energy) blow up. From the point of view of 
mathematical physics, however, the Fhrenfest classification is both too detailed and 
too crude for our current level of understanding. It is too detailed because, as we 
discuss below, only the distinction between first-order and the rest has been put onto 
a firm basis. Consequently, authors usually group all the transitions of order two or 
higher into a single class and call all of them continuous phase transitions — because 
the order parameter, e.g. the magnetization (see below), remains continuous. On the 
other hand, the Fhrenfest classification is too crude, because the possible singularities 
of the free energy are much too varied to be captured in a single integer n. Some 
examples are: 

• The one-dimensional Ising model with 1/r^ interaction, in which it is believed 
[12, 11, 57] that the free energy /(/3, h = 0) is C°° but nonanalytic at the critical 
point /3c, at the same time as the spontaneous magnetization Af (/3, h = 0) = 
—df/dh\fi=o is discontinuous at /3c (Thouless effect) [5]. 

• The two-dimensional XY model (Kosterlitz-Thouless transition), in which it is 
believed [220] that the free energy /(/3, h = 0) is C°° but nonanalytic at /3c; here 
the spontaneous magnetization Af (/3, /i = 0) = —df/dh\fi=o vanishes identically, 
while the zero-field susceptibility x{/^7 h = 0) = —d^f/dh^\h=o is believed to blow 
up at /3c and remain infinite for all /3 > /3c. 

• Systems with disorder, in which it is expected in general (and sometimes proven) 
that at high temperature the free energy is everywhere C°° but nowhere analytic, 
as a function of temperature and/or magnetic field. This phenomenon is known 
as a Griffiths singularity [131]. 

The description of transitions where the number of "macrostates" changes is based 
on the use of order parameters. These are observables acquiring different expectation 
values for the different "macrostates". Fach "macrostate" can be selected either by 
introducing some extra field that is turned off in the limit, or by using the right 
boundary conditions. The connection between this point of view and the existence of 
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singularities in the pressure (free energy) was informally known since the beginning of 
the field: The pressure has to be convex — for the sytem to be stable — hence its only 
possible discontinuities are the existence of "sharp corners" where the various one-sided 
derivatives of the pressure take different values. Each of these values defines a different 
"macrostate". For example, in the case of the Ising model, the right and left derivatives 
with respect to the magnetic field give the two possible magnetizations. One can select 
one of the magnetizations by turning off a positive magnetic field (i.e. coming from 
the right) or a negative one (left limit), or, alternatively, by surrounding the sample by 
spins polarized in the desired form. It turns out that this intuition can be formalized 
in the framework of the variational-principle approach. Using the abstract notion of 
tangent to a convex functional in a Banach space, Gallavotti and Miracle-Sole [139] 
and Lanford and Robinson [231] showed in the mid-I960's how the existence of more 
than one pure phase (ergodic equilibrium measure) is equivalent to lack of (Gateaux) 
differentiability of the pressure (see e.g. [206] or [157, Chapter 16]). Moreover, in 
complete agreement with the above example of the Ising model, the direction in which 
the differentiability fails is precisely the direction of the field conjugate to the relevant 
order parameter, and the different directional derivatives give the expectations of this 
observable in the different pure phases. 

Therefore, if we restrict ourselves to translation-invariant specifications and mea- 
sures, we have the important distinction that first-order phase transitions correspond 
to a change in the number of ergodic equilibrium measures (pure phases), while contin- 
uous transitions do not necessarily change this number and correspond to much more 
subtle phenomena (e.g. slow decay of correlations = fiuctuations propagating over 
macroscopic scales = critical opalescence). The points in parameter space where there 
is a second- (or higher-) order phase transition are customarily called critical points^ 
in analogy to the critical point of liquid-gas systems, which was the earliest-known 
example of this phenomenon. 

For phenomena in which one has to go beyond translation invariance, the connection 
between free-energy singularities and properties of the set of extremal Gibbs measures 
is less clear. Nevertheless, transitions involving a change in the number of extremal 
Gibbs measures are usually called (by analogy rather than logic) "first-order" also in 
this general case. 

Corresponding to the two different notions of "phase transition" mentioned at the 
beginning of this subsection, there are two different types of result on "absence of phase 
transitions": On the one hand, there are results proving the uniqueness of the Gibbs 
measure (|^(n)| = I) [84, 90, 91, 94] or of the translation-invariant Gibbs measure 
(l^mij(n)l = I) [41, 262]. On the other hand, there are results on analyticity of the 
free energy and correlations [140, 205, 286, 88, 93, 96]. In the last two references, 
Dobrushin and Shlosman introduced an extremely strong notion of absence of phase 
transitions, which they call the complete analyticity condition. It corresponds roughly 
to the analyticity of all the finite- volume free energies uniformly in the volume and in 
the boundary conditions . 

It is known that in general the different notions of presence and absence of phase 
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transitions are not equivalent. This non-equivalence is probably due to physical reasons 
in most of the cases, but sometimes it seems an artifact of the mathematical formalism 
[89, 351]. 

2.6.6 When is the Relative Entropy Density Zero? 

We now come to a key question (which will play a crucial role in our RG theory): 
Under what conditions does i{fi\i') = 0? That is, under what conditions is the relative 
entropy in volume A a quantity o(|A|), i.e. a "surface effect"? The answer is simple: 
if ;/ is a Gibbs measure for some interaction, then i{fi\i') = when and only when fi 
is a Gibbs measure for the same interaction. The following two theorems make this 
precise, in a rather strong form: 

Theorem 2.66 Let /ii,/i2 be Gibbs measures (not necessarily translation-invariant) 
for interactions $i,$2 £ 13^, respectively. Then 

limsup-— /A(/ii|/i2) < 2\\^i-^2\\B°/{J+Const) ■ (2.111) 

A/'oo |A| 

// fii and fi2 «^e translation-invariant, this means that 

«(/«l|/«2) < 'i\\^l-^2\Wl{J+Const)- (2.112) 

In particular, if fii and fi2 «^e translation-invariant Gibbs measures for the same in- 
teraction $ G B^ , then z(/ii|/i2) = 0. 

Theorem 2.67 Let H be a quasilocal specification, let v G Qinv{^), 'ind let fi G 
M+i^invi^)- Suppose that there exists a van Hove sequence (A„)„>i such that 

i™-^/A„(/ik) = 0. (2.113) 

Then ji G QtnvO^)- 

Theorem 2.66 is an immediate consequence of estimate (2.64) in Proposition 2.46(b). 
Note, again, that although $i,$2 are required to belong to the "small" Banach space 
B^^ the final estimate is in terms of the B^ j {J + Const) norm, hence much stronger. 

Theorem 2.67 is, on the other hand, a deep and surprising (at least to us) result: 
from a hypothesis on the behavior per unit volume in the infinite-volume limit one 
obtains a conclusion valid for every volume (namely //tta = /w). 

The combination of Theorems 2.66 and 2.67 will play a key role in the proof of the 
First Fundamental Theorem (see Section 3.2). 

Combining Theorems 2.66 and 2.67, we deduce the key result relating the DLR and 
variational approaches to classical lattice systems: 

Corollary 2.68 Let $ G S^ and let fi G Af_|_i^m^,(0). Then fi is a Gibbs measure for 
$ if and only if it is an equilibrium, measure for $. 
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2.6.7 Pathologies in Various Interaction Spaces Bh 

In Section 2.4.4 we introduced a large class of interaction spaces Bh^ of which the 
most important are S° and B^. Now we would like to discuss the physical differences 
between these spaces. This is an important issue, because we need to justify our view 
that (roughly speaking) B^ is the largest "physically reasonable" space of interactions. 

Our point of view is that the fundamental physical principles of infinite-volume 
equilibrium statistical mechanics are given by the theory of specifications and Gibbs 
measures. (We consider the variational theory of translation-invariant equilibrium mea- 
sures to be only a useful technical tool.) Furthermore, we argued in Section 2.3.3 that, 
at least for systems of bounded spins (including, in particular, all models with finite 
single-spin space), a physically reasonable specification must be quasilocal. If then we 
put aside hard-core interactions, it follows from Theorem 2.12 that a physically rea- 
sonable specification must be the Gibbsian specification for some absolutely summable 
interaction. Since B^ is the space of translation-invariant absolutely summable continu- 
ous interactions, this justifies our contention that B^ is the largest physically reasonable 
space of interactions. 

From a mathematical point of view, S° is the natural space of interactions on which 
to develop the variational theory of equilibrium measures. We nevertheless claim that 
S° is, from a physical point of view, much too large; even the variational theory on 
S° is "pathological". (This is connected with the fact that interactions in S° \ B^ do 
not in general define specifications, so there are no DLR equations. For this reason. 
Corollary 2.18 and Propositions 2.46 and 2.59 do not hold in general in S°, and the 
large-deviation theory does not apply to equilibrium measures which are not Gibbs 
measures.) To emphasize that S° is an unphysically large space of interactions, we list 
here some of the strange phenomena that can be proven for interactions in this space: 

1) There is a dense set of interactions in S° with uncountably many extremal 
equilibrium measures [206, Theorem V. 2. 2(c)]. (It is perhaps not surprising that highly 
frustrated interactions could produce uncountably many pure phases; but in S° this 
happens arbitrarily close to zero interaction^ i.e. at what ought to correspond to "high 
temperature".) 

2) For any finite family /ii, ...,//„ of ergodic translation-invariant measures of finite 
entropy density (relative to /i°), there exists an interaction in S° for which all of these 
measures are simultaneously equilibrium measures [206, Theorem V.2.2(a)].'*° (We 
find this result absolutely fiabbergasting: it implies, for example, that there exists 
an interaction in B^ for which the Gibbs measures of the infinite-temperature and 
zero-temperature Ising models are coexisting pure phases!) It follows that in B^ the 
interaction cannot be reconstructed uniquely from the equilibrium measure: for any 
given measure //, there are many different interactions in B^ having // as an equilibrium 
measure. This is in sharp contrast to Proposition 2.46, which asserts the uniqueness 



■*'^This result is reminiscent of the corresponding result in the theory of (non-quasilocal) specifica- 
tions: see the remark at the end of Section 2.3.4. 
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(modulo physical equivalence) of the interaction (if one exists at all) within B^. 

3) The pressure is nowhere Frechet-differentiable in S° [70]. By contrast, the pres- 
sure is Frechet differentiable of order n in a neighborhood of the origin ( "high temper- 
ature") in B"" {n > 2) [177, 228, 298]. ^^ 

Even the space B^ is incredibly large, in that it allows interactions which are strongly 
many-body (though not quite so strongly as in S°) and of arbitrarily long range (pro- 
vided only that they are absolutely summable). This means that even in B^ some 
rather strange phenomena occur: 

4) At low temperature, the Gibbs phase rule is generically violated in all of the 
spaces B^. This is because a first-order phase transition can be destroyed by an arbi- 
trarily weak (in £^ norm) but very long-range two-body interaction [70, 349, 332, 208]. 
The Gibbs phase rule can hold only in spaces Bh where the weight h[X) grows suffi- 
ciently fast with the diameter of X (and not merely its cardinality). 

5) The pressure is not analytic in any open set in any of the spaces B^ [89]; in 
particular, it is not analytic even at "high temperature" (i.e. a neighborhood of the 
origin). In fact, for spaces Bh in which h[X) depends only on the cardinality of X, the 
pressure is analytic in a neighborhood of the origin if and only if h{X) > const X e"^'^' 
for some e> [205, 89]. ^^ 

Remark. In the Ising model, analyticity does hold in B^ norm for the subspaces 
of B^ corresponding to interactions written in lattice-gas or spin form ($x = JxP^ 
or $x = Jx^^^ 1 respectively) [205]. This is a very surprising result, which we do not 
completely understand from a physical point of view. It is related to the fact that 



■*^It seems to be an open question whether the pressure is once Frechet differentiable in a neigh- 
borhood of the origin in B^ . The proofs of higher-order differentiability in [177, 228, 298] use the 
Dobrushin uniqueness theorem, which applies only in B'^ or higher. See also [157, Chapter 8 and the 
corresponding notes]. 

■*^This statement is a slight lie. What Dobrushin and Martirosyan [89] actually prove is the following: 
Let the single-spin space ilo be finite; let h{X) depend only on the cardinality of X, and not satisfy 
ti{X) > const X e'^l^l for any e > 0; and let B'^ be the complexification of Bh- Then, in every open 
set U C B'^ containing a real point, there exists a complex interaction $ G t/ and a sequence of cubes 

A„ /■ cxD such that the finite-volume partition functions 2'a„($) = / exp — -ff* free\ ^'"^ ^^^ zero. 
Thus, the finite- volume free energies have (complex) singularities arbitrarily close to every (real) point 
in Bh- This result makes it very unlikely that the infinite- volume pressure could be analytic in an open 
set oi Bh', but strictly speaking it does not rule it out, because conceivably the singularities present 
in finite volume could miraculously disappear in the passage to the infinite- volume limit. [Here is 
a simple example in one complex variable: Let Zn{z) = z — zq for all n, where zq G C \ R. Then 
lim„^oo ri~^ log Z„(z) = for all z G R (provided that the branch cut is chosen to avoid the real axis). 
And the function certainly does have an analytic continuation from R to Cl] We propose as an open 
problem to mathematical statistical mechanicians: prove that the infinite-volume pressure, which is 
well-defined on the space Bh of real interactions, has no analytic continuation to any open set U C Bf 
containing a real point. In any case, the result of Dobrushin and Martirosyan does show that the 
Dobrushin-Shlosman [93, 96] complete analyticity condition does not hold for any open neighborhood 
in Bh, for the specified class of h- 
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physically equivalent interactions can have widely differing norms in any given space 
Bh] in particular, for lattice-gas or spin interactions, one can have ||$||bi ^ II^IIbVJ" 
[35f]. 

In Section 4 we shall prove that certain renormalized measures are not Gibbsian for 
any interaction in B^. The fact that not even in B^ — a space large enough to support 
much peculiar behavior — does an interaction exist is an indication of how strong this 
result is. 



3 Position-Space Renormalization Transformations: 
Regularity Properties 

In this section we define our general framework for studying renormalization transfor- 
mations (RTs), and prove the two Fundamental Theorems on single- valuedness and 
continuity of the RT map. 

We consider only a single application of the RT map. Therefore, the semigroup 
property of the "renormalization (semi)group" plays no role for us. In particular, we 
need not assume that the image system is of the same type as the original system. 
Nevertheless, we shall occasionally (by abuse of language) use the term "RG map", for 
reasons of familiarity and brevity. 

3.1 Basic Set-Up 

3.1.1 Renormalization Transformation Acting on Measures 

We consider a "renormalization map" T from an original (or object) system (0 = 

Oq ,JF, /i°) to an image (or renorm.alized) system, [ft' = ft'^ ,jF',/i° ). The single-spin 
spaces Oo and Oq need not be the same; indeed, we will present an important example 
in which they are not the same (see Example 5 below, and Section 4.3.5). Although 
our theory in this section works only when the spatial dimensions d and d' are the 
same — see the discussion of Example 7 below, as well as Section 4.5.2 — we find 
it notationally convenient to keep the prime on all image-system quantities, as this 
makes it easy to see which quantity refers to which system. We assume the following 
properties for T: 

Tl) r is a probability kernel from (0, JF) to (0', JF'). 

T2) T carries translation-invariant measures on into translation-invariant measures 
on 0'. [That is, if // G Minv{fl), then fiT G Minv{fl').] 

T3) T is strictly local in position space, with asymptotic volume compression factor 
K < oo. More precisely, there exist van Hove sequences (A„) C Z and (A^) C Z 
such that: 
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(a) The behavior of the image spins in A^ depends only on the original spins in 



A„, i.e. 



For each A G JF^/ , the function r( • , A) is jFA^-measurable. (3.1) 

(b) hm sup J-ttJ < K. 

(Tl) allows the renormalization map to be either deterministic or stochastic. In 
the deterministic case, the configuration io' of the image system is a function io' = 
t{iLi) of the original configuration. The most conspicuous examples of these type of 
transformations are decimation, linear block-spin transformations, and majority rule 
for blocks with an odd number of spins (see Examples 1,2 and 5 below). For the 
general case of a stochastic transformation, given an original-system configuration cu, 
we choose an image-system configuration io' with a certain probability T^u^dio'). The 
special case of a deterministic map t: — )• 0' corresponds to setting T{lo, • ) to be 
the delta-measure S^^^ [i.e. the configuration u' = t[(jj) is chosen with probability 
1]. Fxamples of stochastic transformations are the majority-rule transformation for 
blocks with even number of spins, and more generally the Kadanoff transformation 
(see Fxamples 2, 3 and 4 below). The main point of (Tl) is to exclude transformations 
with negative weights, which have no sensible probabilistic interpretation.'*"^ 

(T2) is self-explanatory. Typically translations of the image system correspond to 
some subgroup of translations of the original system. That is, there typically exists a 
homomorphism R: Z — )• Z such that 

r(r^(.)cu, •) = r.r(cu, •) (3.2) 

for all x G Z and all uj G 0.'*'* For example, a RT employing b X b blocks will have 
R{x) = bx. Thus the translation group Z of the image system corresponds to the 
subgroup R[Z ] C Z of translations of the original system. Property (T2) trivially 
follows from this. Some examples are given below. 

Properties (Tl) and (T2) make rigorous the equation (1.1): the map fi i— )• fiT is 
a well-defined map from Ai+i^inv{^) into Ai+i^inv{^')- This justifies the claim made 
in the Introduction, that it is easy to define the RT map from measures to measures. 
The more difficult and subtle problem of defining the RT map from interactions to 
interactions will be discussed in Section 3.1.3. 

Property (T3) — the strict locality of the renormalization map — is crucial for our 
proofs of the First and Second Fundamental Theorems. Most often (although we shall 
not require this) the probability measure T{lo, ■ ) has a product structure 

T{uj,duj') = n T{uJB,,duj'J , (3.3) 



■^^Transformations with negative weights have occasionally been used in the physics literature, not 
necessarily intentionally: see e.g. [335]. See also the comments in [277, footnote on p. 453 and text on 
p. 496]. 

44ln more detail, T(Tr(^)W, A) = T{lj,T-^[A]) for all a; G Z'*' , w G fi and A G f. 
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where B^ is the finite set of original spins which together determine the image spin 
io'^. Now let us suppose that B^ = Bq -\- R[x) [i.e. Bq translated by i?(x)], where 
R: Z — )• Z is a homomorphism satisfying det i? y^ (obviously this needs d' = d). 
We then claim that (T3) holds with K = |deti?|. Proof: Let (A^) be any van Hove 
sequence in Z . What sets (A„) C Z should we take to satisfy (T3)? Clearly the 
image spins in A„ depend only on the original spins in the set A* = i?[Ay -\- Bq (Z~L . 
So at first one might think to take A„ = A* . The trouble is that the (A*) need not form 
a van Hove sequence, because they may have a nonzero density of "holes". [Consider, 
for example, decimation with spacing 6 > 1: here Bq = {0} and R{x) = hx.] So we 
take instead 

A„ = Z'^ n convex hull of A^ . (3.4) 

Then, using the fact that det i? ^ 0, it is not hard to convince oneself that (A„) is a 
van Hove sequence, and that 

lim P4 = |deti?| . (3.5) 

n^oo A' 
\ n\ 

Two points are relevant here: Firstly, we need det R ^ (and in particular d' = d) 
in order to guarantee that the sets (A„) are sufficiently "fat" to form a van Hove 
sequence (see the discussion of Example 7 below for what can happen if this does 
not hold)."*^ Secondly, the quantity K = limsup^^^^ |A„|/|A^| is by definition the 
asymptotic volume compression factor: as such, it is determined solely by R; it does 
not depend on the size of Bq as long as Bq is finite. 

We conjecture that the two Fundamental Theorems hold also for quasilocal renor- 
malization maps — i.e. maps in which io'^ depends sufficiently weakly on distant spins 
(jjy — but we are not able to prove this with our present methods. Quasilocal renor- 
malization maps are of great practical importance: for example, in "momentum-space" 
renormalization one often uses a deterministic transformation 



iO^ 



J2F{bx-y)uy (3.6) 



with some length rescaling factor b > 1 and some kernel F. In particular, if one uses a 
"soft" cutoff in momentum space [365, 31], then the kernel F is rapidly decreasing at 
infinity in x-space (e.g. decreasing faster than any inverse power of its argument). It 
is an important open problem to extend our results to such maps. 

3.1.2 Examples 

I) Decimation transformation [210, 364]. Let 0' = and d! = J, and let b be an 
integer > 2. Define the deterministic RT map 

cu^ = tobx ■ (3.7) 



■*® Actually, all we really need is that R, considered as a d x d' matrix, have rank d. Thus, we could 
allow some cases with d' > d. But these are of little interest. The interesting cases with d' ^ d have 
d' < d (Example 7 below), and these do not satisfy (T3). 
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This map is strictly local [in fact, of the product form (3.3)] with asymptotic volume 
compression factor K = b . It is of the form (3.2) with R{x) = bx. 

More generally, let 0' = and d' = d and let R be any homomorphism from Z to 
Z satisfying det R y^ 0. Define the deterministic RT map 

'^'x = ^R(x) ■ (3.8) 

This map is strictly local [in fact, of the product form (3.3)] with asymptotic volume 
compression factor K = \ det R\. Some examples are shown in Figure 2(a)-(b). 

2) Majority-rule transformation for the Ising model [276, 278, 277]. Let b be an 
integer > 1, let Bq be a fixed finite subset of Z (the block) ^ and let B^ = Bq -\- bx (i.e. 
Bq translated by bx). Define the map 






f +1 if T.yeB, <7y>^ 
-1 ifE,eB.^. <0 (3.9) 

where "±1" denotes a random choice with probabilities of 1/2 each. This transforma- 
tion is deterministic if b is odd, stochastic if b is even. 

3) Kadanoff transformation for the Ising model [210]. A large class of nonlinear 
RT maps for the Ising model = 0' = { — 1,1}^ can be represented in the following 
form: Consider the same blocks B^ as in the previous example, and let p > 0. Define 
the stochastic RT map 

exp p< E (7y 
T{ay) = n ^ / '""' { ■ (3.10) 

^eJ'*' 2 cosh ip E cFy) 
\ yeB^ ) 

This map is strictly local [and clearly of the product form (3.3)] with asymptotic volume 
compression factor K = b'^, and is of the form (3.2) with R{x) = bx. Many well-known 
RT maps are special cases of (3.10): 

(a) With Bo = {0} and 6=1, (3.10) is model I of Grifliths and Pearce [172, 173], 
a kind of "copying with noise". (This map also arises in applications to image 
processing [129, 152].) As p — )• oo it tends to the identity transformation. 

(b) With Bo = {0} and 6 > 2, (3.10) is model II of Grifliths and Pearce [172, 173], a 
kind of "decimation with noise". As p — )• oo it tends to the ordinary decimation 
transformation (3.7). 

(c) With Bo = {0, 1, . . . , 6 - l}'^ (a hypercube of side b) and b > 2, (3.10) is the 
Kadanoff transformation [210]. In the limit p — )• oo it tends to the majority-rule 
transformation (3.9). 
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Figure 2: Some examples of RT maps in dimension d = 2. 

(a) Decimation with 6 = 2 and A^ = 4. 

(b) Decimation with R[xi^X2) = {xi + a;2,Xi — X2) and A^ = 2 ("checkerboard deci- 
mation"). 

(c) Block transformation with b = 2, Bq = {(0, 0), (1,0), (0, 1), (1,1)} and K = 4. 

(d) Block transformation with R[xi^X2) = (2x1 — 3^2,2:1 + 2x2), Bq = 
{(0, 0), (±1, ±1)} and K = 5 [357]. 

(e) Block transformation with R[xi^X2) = (2x1 + 2:2, Xi + 2x2), i?o = 
{(0, 0), (1, 0), (1, 1)} and K = 3 [276, 278]. 
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yeB^ 
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yeB^ 



As in the decimation transformation, we can replace bx by a more general nonsin- 
gular liomomorpliism R[x). Then K = |deti?|. Some examples are shown in Figure 

2(cHe). 

4) Kadanoff transformation for the N -vector model. For the A^-vector model, in 
which the spins are unit vectors in R , the natural generalization of the majority-rule 
transformation is the "rescaled block-spin transformation" [321] 



(3.11) 



which is deterministic. (In principle one should specify what happens when Y^yeBj; ^y = 
0: for example, one could choose some particular value of cr^, or one could let a'^ be 
uniformly distributed on the unit sphere. But this situation occurs with probability 
zero, so it is irrelevant what choice one makes.) Similarly, the Kadanoff transformation 
has a natural generalization [190]: 

exp U<- E (^yj 
T{a,da') = n \ ^^^- ^ JO(a-), (3.12) 

\ yeB^ J 

where 

— -1 

ZN{h) ^ f e'^-dn{a) = r(|) (^y lK_A\h\) (3.13) 

and dfl denotes uniform measure on the unit sphere in R . As p — )• oo this tends to 
the deterministic map (3.11). 

Analogous formulae can be used to define a Kadanoff transformation for the q- 
state Potts model, using the representation of Potts spins as unit vectors in R'~ 
pointing from the center of a "hypertetrahedron" to its vertices. As p — )• oo this 
transformation tends to the "plurality-rule" transformation with random tie-breakers. 
The Potts model with vacancies [279, 305] can also be treated in this framework, by 
representing the "vacancy" state as the origin in R'~ . 

5) Linear block-spin transformations. A natural choice of a deterministic linear 
transformation is the averaging transformation 



ai 



'J2^y^ (3.14) 



for a suitably chosen reseating factor c. Typically we choose |i?o| ^ < c < 1. We observe 
that if c > |i?o|~^, this transformation does not map any model of bounded spins to 
itself: if Oo = [-M,M], we must take % = [-M',M'] with M' > \Bo\cM > M. As 
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a consequence, the fixed point (s) (if any) for sucli a transformation must correspond 
to model(s) of unbounded spins (i.e. Oq = R)- For tliis reason, it is most natural 
to consider (3.14) as acting, riglit from tlie start, on sucli a system of real-valued 
spins. However, in this paper we are not concerned with fixed points; our interest is in 
whether the first application of the RT map is well-defined. For this purpose we may 
work entirely with models of bounded spins, provided that we are willing to accept 
Oq ^ Oo- For example, in the two-dimensional Ising model with 2x2 blocks (and 
c = 1), we have Oq = {-1, 1} but 0[, = {-4, -2, 0, 2, 4}. 

For unbounded spins with values in R (or R ), one can use either a deterministic 
linear block-spin transformation [147] 



"Px 



yeB^ 



(3.15) 



or the stochastic linear block-spin transformation [27, 18 

1 / , 



r((^, dip') = Yi const X exp 



2e2 



Vx - c 



E 



"Py 



d^'x 



(3.16) 



which corresponds to adding Gaussian white noise of variance e^ to the deterministic 

block spins (3.15). In both cases, the rescaling factor c must be chosen appropriately 

if the transformation is to have a fixed point: e.g. for hypercubic blocks of side h one 

takes 

h~'^''^ to have a high-temperature fixed point 

h'"^ to have a low-temperature fixed point (3-17) 

5-('i+2-'))/2 ^Q have a critical fixed point 

This need to fix a parameter is characteristic of linear renormalization transformations. 
Linear block-spin transformations have attracted the attention of mathematical 
physicists because of their connections with central-limit theorems: see for example 
[200, 32, 147, 71, 58]. 

6) Linear block-spin transformation with large-field cutoff. Fven when the linear 
block-spin transformation (3.14) does not map models of bounded spins to themselves, 
one expects the corresponding fixed-point measure(s) to have rapidly decaying (e.g. 
Gaussian or faster) densities at large (p. Therefore, it may be a reasonable approxima- 
tion to modify (3.14) by cutting off the fields explicitly at \cr\ = Af, where M is some 
fixed large number. That is, on the space = 0'= [—M^MY one can consider the 
deterministic RT map 



^x 



cY^yeB^ Vy 

MsgniZyeB^'Py 



if c 
if c 



^yeBx 'Py 
^yeB:, 'Py 



<M 
> M 



(3.18) 



[This works also for A^-component spins, if one interprets sgn((^) = (^/|(^|.] To our 
knowledge, this transformation has not been considered previously. (But see Cam- 
marota [56] for a related idea.) 
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7) Restriction to a hyperplane [318]. Let Oq = Oq but take d' < J, and define 

u'^ = cu(^,o) (3.19) 

where denotes the origin in Z ~ . This is an unusual type of decimation transforma- 
tion in which the original model is restricted to a hyperplane; it has recently elicited 
some interest (see Section 4.5.2). However, this transformation does not satisfy our 
condition (T3): although the image spins in a volume A^ depend only on the original 
spins in A^ X {0} — so that naively one would have a volume compression factor K = 1 
— the trouble is that the sets A^ X {0} do not tend to infinity (in Z ) in van Hove 
sense when the A^ do so in Z . To make them tend to infinity in van Hove sense, one 
would have to "fatten them out", e.g. by taking A„ = A^ X Cr„ where Cr„ is a cube 
of side Rn in Z ~ , and i?„ — )• oo as n — )• oo. But then the volume compression factor 
K would be infinite. This example makes clear why we need d' = d. Indeed, in this 
example we have i[fi-T\fi^T) > [see Section 4.5.2], contrary to what would happen 
if (T3) were to hold [cf. (3.30)]. 

3.1.3 Renormalization Transformation Acting on Interactions 

We can now define precisely the renormalization map TZ acting on the space of inter- 
actions, making rigorous the diagram (1.2). As argued in Section 2.6.7, the largest 
"physically reasonable" space of interactions is B^^ the space of translation-invariant 
continuous absolutely summable interactions. Therefore, in defining 7^, we shall restrict 
attention to interactions $ G S^ such that there exists an image interaction $' G S^. 
Since a priori we wish to adopt a completely open-minded definition — allowing for 
the possibility of multi-valuedness — we must define 7^ as a relation rather than a 
function. 

Definition 3.1 Let T be an RT map satisfying properties (Tl) and (T2). We then 
define the corresponding map TZ = TZj to he the relation 

n = {($,$') eB^ X B^: there exists fi G G^nv{'n'^) such that jiT G G^nv{'n'^')} ■ 

(3.20) 
We can also think of TZ as a multi-valued function: we write $' G 7^($) as a synonym 
for ($, $') G TZ. We define the domain of TZ to he the set 

domT^ = {$: there exists $' with ($,$') G TZ] 

= {$: 7^($) ^0} . (3.21) 



A priori the map 7^ could be multi-valued. Indeed, the way we have defined it, 
it surely is multi-valued, because of physical equivalence: if $' G 7^($) and ^' G 
B^ n (j7 + Const) ^ then also $' -|- ^' G 7^($). The more interesting question is whether 
7^ can be multi-valued apart from, the "trivial" multi-valuedness caused by physical 
equivalence. The scenario proposed in [72] is precisely the claim that this can happen; 
we shall prove in our First Fundamental Theorem (Theorem 3.4) that in fact it cannot 
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happen. That is, we shall prove that the map TZ is single-valued modulo physical 
equivalence. We shall moreover prove that the phrase "there exists //" in (3.20) can be 
replaced equivalently by "for all //" . 

For RT maps satisfying a very mild continuity condition, we can say something 
about the closure properties of the multi-valued map TZ. To avoid bothersome topo- 
logical complexities, we restrict attention to compact metric single-spin spaces Oq- 

Theorem 3.2 Let Oq be a compact m.etric space, and assum.e that T satisfies (Tl) 
and (T2) and is Feller (i.e. Tf is continuous if f is). Then TZ is a closed .subset of 
B^ X B^ with respect to the B^ j {J -\- Const) X B^ j {J -\- Const) seminorm. 

Proof. Assume that ($„, $^) G Tl and ($00, <J>'oo) £ ^^ X S^ with 

y}^\\^n-^oo\Wl(J+Const) = }}^\\K ' "^'JIb^ /(J+Const) = 0. (3.22) 

We need to prove that ($00, <J>'oo) ^ '^• 

Choose, for each n, a translation-invariant Gibbs measure //„ for $„. By passing to 
a subsequence, we can assume without loss of generality that //„ converges weakly to 
some measure //oo! and since $„ — )• $00 in B^ j {J -\- Const) seminorm, it is easy to see 
that /ioo is a translation-invariant Gibbs measure for $00 • Now the Feller hypothesis 
on T guarantees that iJ^nT ^ fJooT weakly. Since //^T is a translation-invariant Gibbs 
measure for $^, and $^ — )• <^'^ in B^ l{J -\- Const) seminorm, it follows that jiooT is 
a translation-invariant Gibbs measure for ^'^. But this implies that ($oo,$'oo) £ TZ. 



3.1.4 A Remark on Systems of Unbounded Spins 

The results to be proven in Sections 3.2 and 3.3 are in principle applicable to systems 
of either bounded or unbounded spins. But for unbounded spins our results are not of 
much interest, because we restrict attention to bounded Hamiltonians (i.e. absolutely 
summable interactions). The trouble, as discussed at the end Section 2.4.4, is that 
we lack at present an adequate general theory of unbounded spin systems: we are 
unable to specify, for example, a space of interactions that includes all "reasonable" 
interactions. The development of such a general theory is an important open problem] 
it would be a first step towards putting the standard Wilson-style RG theory [365] on 
a rigorous footing. In particular, in such a framework one could try to prove analogues 
of our First and Second Fundamental Theorems. 

In this regard it should be remarked that the important work of Gawedzki and 
Kupiainen [147, 148, 150, 151] on rigorous RG theory does not implement exactly the 
standard Wilson prescription, at least for bosonic theories: while the small-field part 
of the Gibbs measure is represented by a Hamiltonian of the usual kind, the large-field 
part is represented instead by a polymer expansion [148]. (In recent work, Brydges 
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and Yau [53] systematize this idea, and formulate the RG purely in terms of a polymer 
expansion.) For fermionic theories, where there is no "large-field region", Gawedzki 
and Kupiainen [149] do implement the full Wilson prescription; however, fermionic 
theories have no direct probabilistic interpretation. Also, for bosons, Koch and Wittwer 
[218, 219] implement the Wilson prescription, but so far only in the hierarchical model. 

3.2 First Fundamental Theorem: Single- Valuedness of the 
RT Map 

Among the possible pathologies of the RT applied at the level of Hamiltonians, the 
following scenario has been proposed [72]'*^: Consider a Hamiltonian H lying on a first- 
order phase-transition surface, that is, one for which there exist at least two distinct 
pure phases (extremal translation-invariant Gibbs measures), call them /ii and fi2- 
Now perform a renormalization transformation T as indicated in (1.1). The resulting 
renormalized measures fi[ = fiiT and fi'2 = fi2T may then, it is claimed, be Gibbsian for 
two different renormalized Hamiltonians H[ ^ H^- In other words, the renormalization 
map TZ from Hamiltonians to Hamiltonians, defined by (1.2), may be multi-valued . 

Here we disprove such a scenario. We show that if two initial Gibbs measures 
correspond to the same interaction $, then the renormalized measures are either both 
Gibbsian for the same renormalized interaction $', or else they are both non-Gibbsian 
(in which case there is no renormalized interaction at all). 

This theorem follows from comparing the large-deviation properties of different 
Gibbs measures according to whether they belong to the same or different interactions. 
Heuristically, if // and v are two Gibbs measures corresponding to different interactions, 
then the probability of finding in ;/ a large droplet looking like a typical configuration 
for the measure // is exponentially small in the volume of the droplet: 

Prob^(cuA is typical for p) ~ e'^d^D . (3.23) 

On the other hand, if // and v correspond to the same interaction, this probability is 
sub-exponential: 

Prob^cuA is typical for //) ~ e'^d^D . (3.24) 

Mathematically, as seen in Section 2.6, this is expressed in the fact that the relative 
entropy density satisfies 

, , I . J > if /i and v are Gibbs measures for different interactions , . 

1=0 if /i and v are Gibbs measures for the same interaction 



■*®This scenario is stated very clearly in the Monte Carlo paper of Decker, Hasenfratz and Hasenfratz 
[72, p. 23, lines 2-5]. On the other hand, the analytic arguments in the companion paper of Hasenfratz 
and Hasenfratz [189] concern "singularities" whose precise nature is unspecified. We are unable to 
make a connection between the two lines of reasoning. 
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Now, under renormalization one looks only at the block spins and forgets about the 
internal spins, hence 

Probj, (block spins in cua are typical for //) 

> Probj,(a// spins in cua are typical for //) . (3.26) 

Therefore, if initially the probability was subexponential (same interaction), then under 
renormalization it remains so and we can never obtain the exponential decay (3.23) 
characteristic of different interactions. Mathematically, this is expressed by the fact 
that the relative entropy decreases under the application of arbitrary deterministic or 
stochastic transformations, in particular under the RT: 

Lemma 3.3 Let (0, E) and (0', E') be measurable spaces, and let T be a probability 
kernel from (0, E) to (0',E'). Then, if fi and v are probability measures on 0, 

I{}iT\vT) < I{^i\v). 

Proof. This is a well-known result, although it is rather difficult to find a complete 
proof in the literature. (Most of the published proofs concern one or another special 
case: T deterministic, vT = v, discrete state space, etc.) The first complete proof of 
which we are aware is due to Csiszar (f963) [67]; however, we would not be surprised 
to learn that this result was known much earlier. See also, for instance, [36f] and [6f , 
Theorem 8.f]; and see [64] for some stronger results. For the convenience of the reader, 
let us give a one-line proof: 

I{^\v) = I{jixT\pxT) > I{jiT\pT) . (3.27) 

Here the first equality is Proposition 2.53(h): the measures fi X T and v xT have the 
same regular conditional probability given E, namely T . [The intuitive idea is that the 
pair [fi X T^p X T) contains at least as much information as the pair (//, ;/), since the 
latter is the restriction of the former to the sub-cr-field E C E X E'; but it contains 
no more information, because the sam.e probability kernel has been used to generate 
both fi X T and v xT from // and v.] And the inequality is Proposition 2.53(g), since 
fiT (resp. vT) is the restriction of fi X T (resp. p X T) to the sub-cr-field E' C E X E'. 



Theorem 3.4 (First fundamental theorem) Let fi and v be translation-invariant 
Gibbs measures with respect to the same interaction $ G S^, and let T be an RT map 
satisfying properties (T1)-(T3). Then: 

(a) Either fiT and vT are both non-quasilocal (i.e. not consistent with any quasilocal 
specification) , or else there exists a quasilocal specification IT with which both fiT 
and vT are consistent. In the latter case, if the single-spin space is finite, then 
IT is the unique quasilocal specification with which either fiT or vT is consistent, 
and it is translation-invariant. 

86 



(b) Either fiT and vT are both non-Gibbsian (for absolutely summable interactions) , 
or else there exists an absolutely summable interaction $' for which both fiT and 
vT are Gibbs measures. In the latter case, if ^' is continuous [as it always is e.g. 
for a discrete single-spin space], $' is the unique continuous absolutely .summable 
interaction (modulo physical equivalence in the DLR sense) for which either fiT 
or vT is a Gibbs mea.sure. 



Proof. Let (A„) C Z and (A^) C Z be van Hove sequences having the properties 
(T3) assumed in Section 3.1. Now, by Theorem 2.66, the fact that // and v are Gibbs 
measures for the same interaction imphes that 

i™-^/A„(/ik)=0. (3.28) 

On the other hand, the image spins in A^ depend only on the original spins in A„: that 
is, (/uT) \J^'f^i is the image under T of fi t^A„, and likewise for p. Hence, by Lemma 3.3 
we have 

hdl^Tlj^T) < h^j^) . (3.29) 

It follows that 

< limsup— — /A/^(/ir|z/r) < lim — — /A„(/i|z/) 

= . (3.30) 

Therefore, by Theorem 2.67, if pT is consistent with a quasilocal specification H', then 
fiT must also be consistent with this sam.e specification H'. The same argument can 
be made interchanging the roles of fi and p. Thus, either fiT and pT are both non- 
quasilocal, or else there exists a quasilocal specification H' with which both fiT and 
pT are consistent. In the latter case, if the single-spin space is finite. Theorem 2.15 
guarantees the uniqueness of H'. In particular, since fiT and pT (being translation- 
invariant) are obviously consistent with any translate of H', we conclude that H' is 
translation-invariant . 

A special case of the foregoing is: if pT (resp. fiT) is Gibbsian with respect to an 
absolutely summable interaction $', then fiT (resp. pT) must also be Gibbsian with 
respect to this sam.e interaction $'. The uniqueness modulo physical equivalence of $' 
is then guaranteed by Corollary 2.18. ■ 

The First Fundamental Theorem shows that the RT map TZ is single-valued modulo 
physical equivalence. It also shows that the phrase "there exists //" in the definition 
(3.20) can be replaced equivalently by "for all //". 

Remarks. 1. The first step of this proof (using Theorem 2.66) does not require fi 
and p to be translation-invariant. But the second step (using Theorem 2.67) does seem 
to require at least fiT and pT to be translation-invariant. So we do not know whether 
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the hypothesis of translation-invariance of fi and p can be omitted in this theorem. 
(Note: We always assume that the interaction $ is translation-invariant.) 

2. In part (b), the interaction $', if it exists, ought to be physically equivalent in 
the DLR sense to a translation-invariant interaction. Unfortunately, we are not able 
to prove this. From the uniqueness we know that $' is physically equivalent to all of 
its translates; but it seems to be an open question whether this guarantees that $' 
is physically equivalent in the DLR sense to a translation-invariant interaction. An 
affirmative answer would also allow Kozlov's [222] Gibbs Representation Theorem to 
be given a satisfactory translation-invariant version (see the Remark at the end of 
Section 2.4.9). 



3.3 Second Fundamental Theorem: Continuity Properties of 
the RT Map 

A second aspect of the scenario proposed by Decker, Hasenfratz and Hasenfratz [72] 
is that the RT map may be discontinuous at an original Hamiltonian Hq lying on a 
first-order phase-transition surface: namely, for original Hamiltonians H arbitrarily 
close to Hq on opposite sides of the phase-transition surface, it is claimed that the 
corresponding renormalized Hamiltonians H' may be a finite distance apart. 

Here we disprove this scenario too. We show that the RT map is always continuous 
(in a suitable norm) on the set of Hamiltonians where it is well-defined^ that is, on the 
set of Hamiltonians for which the image measures are Gibbsian. 

The key idea underlying our proof is the fact that, if // is a Gibbs measure for an 
interaction $ G S^, then the DLR equations allow the reconstruction of the interaction 
$ (modulo physical equivalence) from the measure //: 

log ^ = - ^ TJ^ + const($) + o(|A|) (3.31) 

(see Section 2.4.8). Therefore, if /ii and /i2 are Gibbs measures for interactions $i, $2 £ 
B^^ we have 

log^ = -J2TJ^,_^, + const($i) - const($2) + o(|A|) (3.32) 

a/i2A 



xeA 



and in particular 



log^ ,,ow.... = |A|||$i-$2||.o/(^+a«...) + o(|A|). (3.33) 



dfj. 



2A 



B{n)/const 



Now the probability densities of renormalized measures are (particular) weighted av- 
erages of the original densities, so the supremum of the renormalized density cannot 
exceed that of the original density. That is, || log[dfii/dfi2)\\B(n)/const can only decrease 
under the RT: 
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Lemma 3.5 Let (0, E) and (0', E') be measurable spaces, and let T be a probability 
kernel from (0, E) to (0',E'). Let fi and v be probability measures on 0, with fi <^ p. 
Then fiT <^ vT and in fact 



d[fiT) 



log 



^°S d{.T) 
d[fiT) 



L'^(vT) 



d{vT) 



L°°(i/T)/ const 



< 



< 



log 
log 



dfi 
dp 

dfi 
dp 



L°°(i/) 



L°^ (i^)/ const 



(3.34) 
(3.35) 



Proof. Suppose that the Radon-Nikodym derivative (= density) dfi /dp satisfies 



dfi 
< a < — — < < +00 z/-a.e. 

dp 



(3.36) 



Then ap < fi < bp (in the sense of the usual ordering on positive measures), so 
obviously a[pT) < fiT < b[pT). It follows that 



djfiT) 
a < ,, < 6 \pi j-a.e. 



d{pT) 



Si 



mce 



dfi, 
dp 



L°°(i/) 



dfi 
dp 



L°^ (i^)/ const 



max(log 6, — log a) 
l(logfe-loga) 



(3.37) 

(3.38) 
(3.39) 



[where a and b are the sharpest values making (3.36) true], with an analogous formula 
for d[fiT)/d[pT)^ the lemma is proven. ■ 

Theorem 3.6 (Second fundamental theorem) LetT be an RT map satisfying prop- 
erties (T1)-(T3), and let $i, $2 G dom7^. Then, for all $; G 7^($l) and $'2 G 7^($2), 



l^i - ^'2\Wl(J+Const) < Al$i - $ 



2\\B°/{J+Const) 



(3.40) 



That is, on its domain the map TZ is Lipschitz continuous (with Lipschitz constant 
< K) in the B^ I {J + Const) norm. 



Proof. Let (A„) C Z and (A^) C Z be van Hove sequences having the properties 
(T3) assumed in Section 3.1. Let fii G Qinvi^'^^) and fi2 G Qinvi^'^^)- By the First 
Fundamental Theorem (Theorem 3.4) we have fiiT G Qinvi^'^^) and fi2T G Qinvi^'^^)- 
Now the image spins in A^ depend only on the original spins in A„: that is, (fiiT) [JF^; 
is the image under T of fii \J-'a„7 and likewise for fi2. Therefore, by Lemma 3.5 we have 



log 



d{fiiT)j 



d{fi2T)j 



B{n')/const 



< 



log 



d[fi 



1 An 



d{fi 



2 An 



B{n)/const 



(3.4i: 



89 



(Since the measures /i2 and fi2T are both Gibbsian, they give nonzero measure to every 
open set; and moreover the interactions $i, $2, $1 and $2 are all continuous. Therefore 
we can replace the essential sup norms by the true sup norms.) Then 



\K -"^21180 /(J+Const) = }}^iXn ^°g 

log 



1 

K 



d{fiiT) 



A' 



I A. 



d{jl2T)A: 



d[fi 



2)k„ 



B(Q')/const 
B{n)/const 



< lim 

= K\\^^ - ^^hyij+const) , (3.42) 

where we have twice used Proposition 2.46(b). ■ 

It is curious that although all the interactions $1, $2, $i and $2 are here required 
to belong to B^^ the Lipschitz estimate (3.40) is stated in S° norm. This is because S° 
(or more precisely its quotient by J7 or J7 + Const) is the natural norm for measuring 
bulk energy contributions, as discussed in Section 2.4.8. The restriction to B^ is needed 
solely to ensure that the boundary energy contributions are o(|A|), so as to avoid the 
pathologies discussed in Section 2.6.7. In any case, we would like to emphasize that 
all the B" norms are equivalent (up to a bounded factor) for interactions involving 
boundedly many spins at a time (e.g. two-spin interactions), even when they are of 
arbitrarily long range. The difference between the B" norms concerns how they treat 
interactions that are very strongly multi-body . 

Theorem 3.6 constrains very strongly the ways in which the RT map can blow up as 
$ approaches the boundary of its domain. Indeed, suppose that ($„)„>i is a sequence 
in domT^ C B^ that converges in B^ norm [or more generally, in B^ j {J + Const) 
seminorm] to $00 £ B^ . (We need not require convergence in B^ norm, nor need we 
require that $00 belong to B^ .) Next let ($^)ra>i be any choice of renormalized interac- 
tions [i.e. $^ G TZ{^n) C B^]] here the choice concerns the selection of representatives 
modulo physical equivalence. Then (3.40) guarantees that ($^) is a Cauchy sequence 
in the B^ j {J -\- Const) seminorm, hence converges in B^ j {J -\- Const) seminorm to 
some $'q^ G S°; moreover, this limit is unique modulo physical equivalence (i.e. modulo 
J -\- Const). Now, if $00 and ^'^ (or any interactions in their physical-equivalence 
classes) are both in B^ , then it follows from Theorem 3.2 that ($oo,$'oo) ^ ^7 hence 
$00 £ domT^. Therefore, if $00 (H B^ \ domT^, it must be that $'^ is not physically 
equivalent to any interaction in B^ . 

One would like to conclude from this that the B^ (semi)norms | |$^||bi/( j"+Corast) niust 
diverge as n — )• 00. Unfortunately, we are not quite able to prove this, because we have 
not been able to prove a version of Proposition 2.39(a) modulo physical equivalence 
(cf. Proposition 2.43). The best we have been able to prove is the following: 

Corollary 3.7 Let Oq be a compact metric space, and assume that T satisfies (Tl)- 
(T3) and is Feller. Let ($„)„>i be a sequence in domT^ C B^ that converges in B^ j {J -\- 
Const) seminorm to $00 (H B^ \ domT^. For each n, let $^ be any interaction in 
7^($„) (lB\ Then: 
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(a) Either ($^)ra>i fails to converge in B^ , or else ||$^||bi ~^ oo- 
// the single-spin space Oq is finite, then we also have: 
(h) For any h^ 1, \\^'J\Bh/{j+Const) ^ oo. 

Proof, (a) Suppose that $^ — )• ^'^ in S°, but ||$^||bi /> oo. So there is a subse- 
quence of ($^) on which the B^ norm is bounded, say by M; and Proposition 2.39(a) 
then imphes that ^'^ G B^ (with ||$'q^||bi < M). But by Theorem 3.2 this means that 
($00, $'oo) ^ ^7 contrary to the hypothesis that $00 ^ domT^. 

(b) As argued above, the equivalence classes [$y = $^ + J7 + Const converge in 
B^ I {J + Const) to some equivalence class [$'3^]. It follows that one can choose new 
representatives $^ G [^^ and ^'^ G [$'oo] such that $^ — )• $'q^ in S°. Now suppose 
that Wn\\Bnl(J+Const) = Wnh^/iJ+Const) 7^ OO- Then there is a subsequence of ($^) 
on which the Bhj {J + Const) seminorm is bounded, say by Af; and Proposition 2.43 
then implies that $'^ G Bh-\- J -\-Const (with ||$'oo 1 1 Bh/(j"+Corast) ^ M). But this means 

that there exists $'oo G S/, C S^ (with ||$'oo||b;, < M) such that $'oo G [^'J\ = [^'^]. 
And by Theorem 3.2 this means that ($00, $'oo) £ ^, contrary to the hypothesis that 
$00 ^ domT^. I 

Our inability to prove the divergence of the B^ seminorm is not as serious as it may 
seem: as will be discussed in Section 6.1.2, one probably wants anyway to formulate 
RG theory in a space Bh of short-range interactions, and for such a space our result 
(b) is sufficient (when Oq is finite). 

4 Provably Pathological Renormalization Trans- 
formations 

4.1 GrifRths-Pearce-Israel Pathologies I: Israel's Example 

4.1.1 Introduction 

Griffiths and Pearce [172, 173, 171] were the first to point out the possible existence 
of what they called "peculiarities" of the RT. These peculiarities were exhibited in 
models in which the internal spins undergo a phase transition for some fixed block-spin 
configuration. They observed that in such a situation the correlation functions of the 
internal-spin system could become discontinuous functions of the block spins, which 
implies that each of the terms of the (formal) expansion yielding the renormalized 
Hamiltonian (1.3) could be discontinuous. This casts doubts on the convergence of 
such an expansion, and hence on either the existence or the continuity properties of 
the renormalized Hamiltonian. 

This situation was further clarified by Israel [207] in the particular case of the 
6 = 2 decimation transformation. He argued that when such peculiarities exist, a 
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very weak locality condition is violated by the renormalized measure: the conditional 
expectation for a single site is a discontinuous function (in the product topology) of 
the boundary conditions. That is, it is possible to fix the block-spin configuration 
in an arbitrarily large volume around the origin in such a way that what happens 
at the origin depends strongly on the block spins which are outside of the volume. 
The set of configurations for which this pathology occurs is improbable, but not of 
zero measure. In our terminology, the renormalized measure is non-quasilocal: that is, 
it is not consistent with any quasilocal specification. In particular, the renormalized 
measure is not the Gibbs measure for any uniformly convergent interaction. 

In this section we fill in the technical details of Israel's argument, thereby converting 
it into a rigorous proof. In the following sections we shall generalize Israel's argument 
to other models and other renormalization transformations. In all cases, the underlying 
physical mechanism causing the non-Gibbsianness of the renormalized measure is the 
same: the infiuence from the block spins outside the specified volume is transmitted to 
the origin via the internal spins in the intermediate region, by-passing the block spins 
in the finite environment of the origin. This occurs because the internal spins have a 
phase transition, and the block-spin boundary conditions can pick different phases of 
these internal spins. 



4.1.2 Israel's Example: Decimation in d = 2 

Let us present now Israel's example — the two-dimensional Ising model at low temper- 
ature and zero magnetic field, using the b = 2 decimation transformation — together 
with the proof that after one renormalization step the renormalized measure is no 
longer Gibbsian.'*^ The strategy of the proof is to show that the renormalized measure 
exhibits grossly non-local correlations, in the sense that the conditional probability dis- 
tribution of the spin at the origin, as a function of all the other spins, depends strongly 
on the spins arbitrarily far away from the origin. More precisely, we shall show that if 
we take an arbitrarily large cube and fix all the block spins inside, except the origin, 
in a fully alternating configuration, then the renormalized magnetization at the origin 
depends strongly on the block-spin configuration outside of the cube. 

The ferromagnetic nearest-neighbor Ising model in Z is defined by the formal 
Hamiltonian 

H = -jJ2a,a,, (4.1) 

where (ij) denotes nearest-neighbor pairs and J plays the role of an inverse tempera- 
ture. We shall use the decimation transformation with scale factor b = 2. The image 
(or block) spins are those spins with both coordinates even, while the remaining spins 



■*^It is well known that the decimation transformation is badly behaved m the limit of infinitely 
many decimations [210, 364]: for example, any fixed point must have a two-point correlation function 
{(To'jiTx) which is independent of x (so in particular doesn't decay as \x\ -^ cxd). But the present 
example is much more drastic, as the problems appear after a single step. 
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are the internal spins. We denote by (Z y™"se (^j-esp. (Z )™*) the set of image (resp. in- 
ternal) spins. More generally, if A is a subset of Z^, we denote by A™"^^ = An(Z^)™"^^ 
(resp. A™* = A n (Z^)™*) the set of image (resp. internal) spins in A. 

The proof of non-quasilocality of the renormalized measure goes in four steps: 

Step 0. Computation of the conditional probabilities for the image system.. These 
conditional probabilities turn out to be related to expectation values in a system of 
internal spins, with fixed image spins io' . 

Step 1. Selection of an image-spin configuration ^' edai- ^^ ^nd an image-spin 
configuration io'^^^i such that the corresponding system of internal spins has a non- 
unique Gibbs measure (i.e. a first-order phase transition). 

Step 2. Study of a neighborhood of io'^^^i. We study the internal-spin system for 
image-spin configurations u' in a neighborhood of cu^ j^j, and show that the internal- 
spin order parameter is a discontinuous function of u' . In physical terms, this means 
that the internal-spin order parameter depends sensitively on the image-spin configura- 
tion arbitrarily far from the origin, if the image-spin configuration in the intermediate 
region is set to cu^pedai- 

Step 3. "Unfixing" of the spin at the origin. This is a technical step relating the 
image spin at the origin to the internal spins nearby. (After all, we want the conditional 
probabilities for image spins, not internal spins.) 

Let us now discuss these steps in detail: 

Step 0. Computation of the conditional probabilities for the image system.. Let fi 
be any Gibbs measure for the ferromagnetic Ising model in Z with nearest-neighbor 
coupling J. Our goal is to show that, for J sufficiently large, the image (decimated) 
measure fiT has non-quasilocal conditional probabilities. Therefore, our first order of 
business must be to compute these conditional probabilities. To do this, we use the 
only fact we know about the measure //, namely that it satisfies the DLR equations for 
the ferromagnetic nearest-neighbor Ising model. 

The present case is relatively simple, because the image spins are simply a subset 
of the original spins. Let, therefore. A' be a finite subset of Z ; we wish to compute 
the conditional probabilities £'^x(/|{Cj}jeA'=) for functions / of the spins {(t'}j£A'- But 
these are just the conditional probabilities E^[f\{cri}i^2(A"')) for functions / of the spins 
{cfc}fce2A'- There is a slight complication now, because the set 2(A'^) = (2^)"""^^ \ 2A' 
is not the complement of a finite set; its complement consists of the image spins in 
2 A' plus all the internal spins. Therefore, the DLR equations for the original model 
do not immediately tell us how to condition on {cr;};£2(A'=)- However, we have studied 
this problem in Section 2.3.7; the conclusion (Proposition 2.25) is that the conditional 
probability measure //( • |{c/};g2(A'=)) is, for /i-almost-every {cr;};£2(A'=)7 ^ Gibbs measure 
for the Ising model restricted to volume (2A') U (Z )™* with external spins set to 
{(T;};g2(A'=)- This latter system is specified by the same formal Hamiltonian (and hence 
the same interaction) as the original Ising model, except that now only the spins in 
(2A') U (Z )™* are considered to be random variables, and the spins {cr;};£2(A'=) are 
considered to be fixed. 

Note that we know only that //( • |{c/};g2(A'=)) is som.e Gibbs measure for the re- 
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stricted interaction: if the restricted interaction happens to have more than one Gibbs 
measure, then we have no way of knowing which one is //( • |{c/}/g2(A'=))- Therefore, 
we shall have to prove bounds which are valid uniformly for all Gibbs measures of the 
restricted interaction. This is what we shall do in Steps 2 and 3 below. 

Note also that this computation of the conditional probabilities is asserted to be 
valid only for fi-almost-every {cr;};£2(A'=); indeed, conditional probabilities are only 
defined up to modifications on a set of measure zero. Therefore, in order to prove non- 
quasilocality we must prove not only that this particular version of the conditional 
probabilities is a discontinuous function, but that no function obtained from this one 
by modification on a set of //-measure zero can be continuous. That is, we must prove 
that the conditional probabilities are essentially discontinuous. We shall do this in 
Steps 2 and 3 below. 

It is convenient to study first the system of internal spins alone, i.e. the system 
in (Z )™* with all image spins {cr'Aj^i^ set to fixed values. We call this system the 
modified object system for image-spin configuration {c^'A j^j2. In Step 3 below we will 
"unfix" the image spins in the volume A'. In fact, it suffices to consider just one 
particular volume A', which we shall take to be {0}. 

Step 1. Selection of an image-spin configuration io'^^^i. Our goal is to show that 
the conditional probabilities //( • |{c/};g2(A'=)) are essentially discontinuous functions of 
{(T;};g2(A'=)- Therefore, we must find a point u' = {(j'A ^^i^ of essential discontinuity. A 
good candidate would be an image-spin configuration cu^ j^j such that the correspond- 
ing system of internal spins has a non-unique Gibbs measure. Indeed, non-uniqueness 
of the Gibbs measure means that the internal spins in volume 2A' depend sensitively 
on the internal spins arbitrarily far from the volume 2A' (albeit with the intermediate 
internal spins free to fiuctuate); so it is a reasonable guess that the Gibbs measure 
might depend sensitively also on the image spins arbitrarily far away (but with the 
intermediate image spins held fixed at ^' edai)? ^"^^ ^^^^ i^ precisely the statement of 
essential discontinuity (see Step 2 below). 

For the 6 = 2 decimation transformation, such a configuration io'^^^i was found by 
Griffiths and Pearce [173, 171]: it is the fully alternating configuration lo'^h defined by 

<,, ^ <J2na^. = {-ir^'^ (4.2) 

[see Figure 3(a)]. Notice that each internal spin is adjacent either to two image spins of 
opposite sign — in which case the effective magnetic fields cancel — or else to no image 
spin. Therefore, the modified object system is simply a ferromagnetic Ising model in 
zero field on a decorated lattice [342], as shown in Figure 3(b). Now we can explicitly 
integrate out the spins in the decorated lattice that have exactly two neighbors, yielding 
an effective coupling J' = | log cosh 2 J between those two neighbors. The result is an 
ordinary ferromagnetic Ising model on Z , with nearest-neighbor coupling J' and zero 
magnetic field [Figure 3(c)]. If J' > J, = |log(l + ^2) = 0.440686..., that is, 
J > I cosh" (1 -|- v2) = 0.764285. . . ~ 1.73Jc, then the modified object system for 
image-spin configuration co'^n has two distinct Gibbs measures, a "-|-" phase and a — 
phase (obtainable by using "-|-" or — boundary conditions, respectively). 
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Figure 3: (a) The fully alternating image-spin configuration io'^n- (b) The decorated 
system of internal spins, (c) The equivalent nearest-neighbor interaction on (2Z)^. 
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Step 2. Study of a neighborhood of u'^ -^^^ = io'^n- The next step is to study image- 
spin configurations in a neighborhood (in the product topology) oi lo'^h. To show that 
the order parameter (crj)^^/ is an essentially discontinuous function of the image-spin 
configuration cu', it suffices to show that there exists a constant (5 > such that in each 
neighborhood of lo'^h the essential oscillation of (crj)^^/ is at least 8. More precisely, it 
suffices to show that there exists (5 > such that in each neighborhood Af 3 u'^^ there 
exist nonempty open sets Af+^Af- C Af and constants c_|_ > c_ with c_|_ — c_ > (5 such 
that 

(ci)^)' ^ c_|_ whenever u' G A/+ (4.3a) 

(ci)^)' ^ c_ whenever u' G Af- (4.3b) 

Now a basis for the neighborhoods Af 3 io'^^ is given by sets of the form 

AIr = {io': Lo' = Lo'^i-f. on A^^, u' = arbitrary outside A^^} , (4-4) 

where A^^ is a square of side 2i? -|- 1 centered at the origin (here R is an unprimed 
distance). We shall take Af+^Af- to be sets of the form 

AfR,R',+ = {io': id' = Lo'^n on h.R, Lo' = -\-l on A^ri \ h.R, Lo' = arbitrary outside A^^/} 

(4.5a) 

ATr^ri- = {io': Lo' = Lo'^n on A^^, u' = —1 on A^ri \ A^^, u' = arbitrary outside A^^/} 

(4.5b) 

with R' chosen appropriately as a function of R [R < R' < oo). The motivation 
behind this choice is that setting the image spins in Ari \ Ar to be all -|- (resp. all — ) 
is expected to push the system into its -|- (resp. — ) phase. The remainder of Step 2 is 
devoted to proving that this is in fact the case. 

Since we know only that the conditional distribution ( • )^^i is some Gibbs measure 
for the modified object system, we need to prove the bounds (4.3) uniformly for all 
Gibbs measures for this system. It suffices to show that there exists R" < oo such 
that the Gibbs measure for the modified object system in the finite volume A^h, with 
image spins io' G A/r^ri^^ (or A/r^r'^-) and arbitrary internal-spin boundary condition 
{o';};g(22)jntw»"t , satisfies the bounds (4.3). For simplicity we shall take R" = R' . 
In fact, in this two-dimensional example (but not in higher dimensions) we can take 
R' = R -\- 2. Therefore, we are led to the following situation: 

Let Ar be the square of side 2R -\- 1 centered at the origin, and let Tr = A^^ \ A^^-i 
be the R^^ layer. Now choose an even number i?, and consider all the configurations 
with the image spins in A^^ fixed in the alternating configuration cu^/j, and those in the 
second layer outside A^^ (that is, in T'r^2'^) fixed to be "-|-". The spins outside Ar^2 
[both image and internal) are fixed in some arbitrary configuration. The situation is 
depicted in Figure 4(a), where a circle represents an internal spin which fiuctuates over 
all possible values. Now consider all the resulting systems of internal spins in A^|2 given 
the above configuration of image spins in A^"2^^ and an arbitrary fixed configuration 
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(of both image and internal spins) outside A7^_|_2. We want to convince the reader that 
all the measures so obtained have local magnetizations {(Ji^^i^) for (zi, 22) G ^r+i which 
are bounded below by a strictly positive constant, uniformly in R (sufficiently large) 
and uniformly in the boundary condition outside A7^_|_2. The sequence of bounds used 
in our proof is summarized in Figure 4. 

Each internal spin in A^|2 feels an "effective magnetic field" it J from each image 
spin adjacent to it; but because the image-spin configuration in A^^ is alternating, these 
"effective magnetic fields" are all zero except at some sites in layers r^^+i and Tr^2- 

(i) An internal spin in layer r^^+i feels an effective field +2 J if it is adjacent to two 
"+" image spins. 

(ii) An internal spin in layer Tr^2 feels an effective field -\-3J or -\-J depending on 
whether the adjacent spin in layer Tr^3 (which is always an internal spin) happens 
to be "+" or "-". 

We therefore consider the system of internal spins in A^|2 with the magnetic fields 
described in (i) and (ii) above [Figure 4(b)]. 

Next we notice that by the FKG inequality [f26] (or alternatively the Griffiths II 
inequality [341]), the local magnetizations {(Ji^^i^) for (zi, 22) G ^r+2 ^^^ bounded below 
by the values that they would take if the magnetic fields +2 J in (i) were changed to 
zero, and the fields -\-3J in (ii) changed to -\-J. We now have a system consisting of 
the spins in A^|2 7 with a magnetic field -\-J on each spin in layer r^|2 [Figure 4(c)]. 

This latter system lives on a finite subset of the decorated lattice. We can explicitly 
integrate out the spins in A^|;^ that have exactly two neighbors (namely, the spins 
that have one coordinate even and one coordinate odd), yielding an effective coupling 
J' = I log cosh 2 J between those two neighbors. Similarly, we can integrate out the 
spins in r^|2 7 yielding an effective magnetic field h' = | log cosh 2 J > on each 
remaining spin in r^|;^, except that the field is 2h' at the corners [Figure 4(d)]. But this 
last system is equivalent to a square lattice of size [R-\-2) X [R-\-2) with nearest-neighbor 
coupling J' and -|- boundary conditions [Figure 4(e)]. As i? — )• 00 this system tends to 
the "-|-" phase for an Ising model with coupling J'. In particular, the magnetization 
(cTjj^j^) of any spin remaining in this system (i.e. any spin with Zi and Z2 both odd) 
tends to the spontaneous magnetization Afo(J'), which is > if J' > Jc- We can now 
return to the decorated lattice, to compute the magnetization {(Ji^^i^) on the internal 
spins that got integrated out (i.e. the ones with Zi even and Z2 odd or vice versa): 

{^^U^2)A';ll, = {tanh{J{(T' + (t")))r+2 

= {{}Unh2J){a' + a"))R+2 
— > (tanh2J)Mo(J') > (4.6) 

where a' and a" are the two internal spins adjacent to (Ji^^i^. We have therefore proven 
our claim that the magnetizations {(Ji^^i^) for (21,22) G ^r+i ^^^ bounded below by a 
strictly positive constant [namely (I — e)(tanh2J) Afo(J') for any e > 0], uniformly in 
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Figure 4: Sequence of bounds proving a lower bound on the magnetization for the 
internal spins. 
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Figure 5: Decorated lattice when the spin at the origin is free to fluctuate. 



R > i?o(^i,^2) and uniformly in the boundary condition outside A7^_|_2. Repeating the 
argument but with the image spins in Tr^2 chosen as "—"^48 ^^ obtain the "— " phase 
for the internal spins and thus a strictly negative upper bound on {(Ji^^i^). This proves 
that the local magnetization, say for the four internal spins neighboring the origin, is 
determined by the image spins at faraway distances. 

Step 3. Unfixing of the spin at the origin. We now have to make a slight modifl- 
cation in the preceding argument, as the system we really want to study is the system 
consisting of the internal spins in h.Rj^2 'ind the spin at the origin, with the image spins 
in A^"^^ other than the one at the origin flxed in the alternating conflguration io'^n, 
the image spins in layer V^^,^^'^ set to be "+", and the spins outside h.Rj^2 (both image 
and internal) flxed in some arbitrary conflguration. By the same reasoning as before, 
we obtain a system on the decorated lattice plus the origin, with a coupling J between 
the origin and its four neighbors and an additional magnetic field —J on the neighbors 
of the origin [Figure 5]. Denoting by ( • )_|_ (resp. ( • )^) the expectation in the old (resp. 



■*®Because we have fixed the image spin at the origin to be +, the two situations are not quite 
symmetric. But the only change is a shift in the location of the internal spins in V r^i which feel a 
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new) decorated system, it is easy to see that 
(cro,o)7 = 



E(7o,o=±i (^0,0 exp [J(cro,o - l)(cro,i + ctq-i + cri,o + cr_i,o)])_^ 



E<7o,o=±i (exp [J{crofi - l)(cro,i + ctq -1 + CTifi + cr_i,o)])_^ 
1 - (exp [-2J(cro,i + ctq -1 + ai^ + cr_i,o)])_^ 



(4.7) 



1 + (exp [-2J(cro,i + ctq -1 + cti^q + cr_i,o)])_^ 
Similarly, for the analogous system with the image spins in V^^,^^'^ set to "— ", we have 

1 - (exp [-2J(cro,i + ctq -1 + cti^q + cr_i,o)])_ 



(^o,o)^ 



1 + (exp [-2J(cro,i + ctq -1 + cti^q + cr_i,o)])_ 
1 - (exp [+2J(cro,i + ctq -1 + cti^q + cr_i,o)])_^ 
1 + (exp [+2J(cro,i + ctq -1 + cti^q + cr_i,o)]) , 



(4.8) 



Therefore, 



where 



<-'°>^ - <-'»>- = (Trirb • '"> 



X = (exp [-2J(cro,i + ctq -1 + CTi^o + cr-i,o)])+ (4-10) 

y = (exp [+2J(cro,i + CTo-i + 0-1,0 + cr_i,o)])_^ (4-11) 



N 



ow 



y-x = 2(sinh2J(cro,i + CTo-i + CTi^o + cr-i,o)) + 

= 2 ^ ^-^{{(70,1 + (70,-1 + (Jifl + a_iflf) + 

k = l 
k odd 

> 4J ((cro,i + CTo-i + 0-1,0 + cr_i,o)) + 

= 16J(ao,i)+, (4.12) 

since the contributions from k = 3,5,... are all nonnegative by Griffiths' first inequality. 
On the other hand, the denominator in (4.9) is bounded between 1 and (1 + e*"^)^. 
Since we proved previously that for J' > Jc, the local magnetization (o-o,i)+ is bounded 
below by a strictly positive constant, uniformly in R (sufficiently large) and in the 
configuration outside A7^_|_2, we can conclude that 

(^o,o); - (^o,o): > <5 > (4.13) 

uniformly in R (sufficiently large) and in the configuration outside h.Rj^2- 



nonzero effective fiefd; and this is irrefevant, since we repface these fiefds by zero anyway. 
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Conclusion of the argument. In summary, we have shown that at zero magnetic 
field and any sufliciently low temperature, given any Gibbs measure //, the renormalized 
measure fiT has the following property: Let co'^n be the fully alternating configuration 
cr'-^ -^ = ( — l)'^"'"'^; let Ar^^ = J\ffi/2,{R/2)+i,+ be the set of all configurations {a'} that 
are alternating in Aji/27 "+" in r(_R/2)+i and arbitrary outside; and let Ar^^ be analo- 
gously defined but with "— " in T(^ji/2)+i- In the product topology Ar^^ and Ar^^ are 
open sets — in particular, they have strictly positive (/ir)-measure — and given any 
neighborhood of J\f 3 lo'^h we always choose R large enough so that Ar^j^ U Ar^^ C Af . 
Moreover, we have proven that for all lo[ G Ar^j^ and u^ G Ar^^, we have 

E^,T{cr'ofl\{cr[^^,^](^^,^^)i.(ofi)){to'^) - ^'a^t (cro,ol{c^'i,»-2}(»i.»'2)^(o,o)) (^2) > "^ > . 

(4-14) 
This means — as first pointed out by Israel [207] — that the conditional expectations 
of cTq Q are discontinuous as a function of the boundary conditions. More precisely, 
they are essentially discontinuous: no modification on a set of (/ir)-measure zero can 
make them continuous at io'^if Now, for systems with a finite single-spin space (such 
as the Ising model), continuity is equivalent to quasilocality. Therefore, what we have 
really proven is that the renormalized measure fiT is not consistent with any quasilocal 
specification. (In our language, fiT is non- quasilocal; in the terminology of Sullivan 
[336], it is non-almo.st Markovian.) In particular, fiT is not Gibbsian for any uniformly 
convergent interaction. 

Theorem 4.1 Let fi be any Gibbs mea.sure for the two-dimen.sional Ising model with 
nearest-neighbor coupling J > | cosh" (1 -|- v2) = 0.764285... ~ 1.73Jc and zero 
magnetic field. Let T be the decimation tran.sformation with .spacing 6 = 2. Then the 
mea.sure fiT is not consistent with any quasilocal specification. In particular, it is not 
the Gibbs mea.sure for any uniformly convergent interaction. 

In physical terms, we have shown that the value of the renormalized spin at the 
origin, ctqq, depends strongly on the values of the renormalized spins arbitrarily far 
from the origin, if the renormalized spins in the intermediate region are fixed to be 
alternating. Such a long-range dependence is incompatible with the measure fiT being 
Gibbsian for any reasonable interaction. 

4.2 GrifRths-Pearce-Israel Pathologies II: General Method 

In this section we abstract the essential features of the Griffiths-Pearce-Israel argument, 
in order to prepare the way for generalizations to more complicated examples. 

Step 0. Computation of the conditional probabilities. 

This step is technical and messy, but the final result is the obvious one [cf. (4.22)/(4.23) 
below]. The reader is therefore invited to skip this step on a first reading. 

For decimation, the computation of the conditional probabilities of fiT was an 
immediate application of Proposition 2.25. For more general RT maps, it will be a 
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more complicated application of this same proposition: the idea is to consider first a 
joint system of interacting spins io and cu', and then decimate this system to the space 
ft'. 

If fi is any measure on the system of original spins (i.e. on 0), and T is any prob- 
ability kernel from to 0', then the joint measure fi X T on ft X ft' is well-defined 
by 

(/ixr)(A)= [diJ.{io) [T{io,dio')xA{io,io') (4.15) 



for measurable sets A C ft X ft'. Now, if 11 = {tta} is a specification for the system 
of original spins, we wish to define a specification II (/) T = {(Yl (/) T)a,a'} for the joint 
system, with the property that 

fi consistent with 11 =^ fi X T consistent with II (/) T . (4.16) 

(In fact, the converse should also hold, i.e. a measure p on ft X ft' should be consistent 
with n (x) r if and only if fi = p IJ-" is consistent with 11 and p = fi X T .) 

For simplicity let us assume that the probability kernel T has the following form: 

T{uj,duj') = Yl f^{ujB,,oj'Jdp^{uj'J (4.17) 

xec 

where the p^ are probability measures, and the B^ are finite sets of original spins which 
together determine the image spin io'^. We also assume that the family of sets {Bx}xejC' 
is locally finite'*^, i.e. only finitely many image spins x depend on any given original 
spin y. Now, to motivate the construction, suppose that // is a Gibbs measure for an 
interaction $ (and a priori measure /i°). Then, formally the measure fi X T is given 
by 

(/i X T){d(jj,d(jj') "=" const x 

Yl e-^xH Yl uuB^.u'j n ^/^°K) n dM^'x) ■ 

xcc x^c xec xec 

(4.18) 

Of course, the first two infinite products (the ones over functions e"*-^^'^) and T^) 
are meaningless, but we know what to do: to describe the conditional probability 
distribution fi X T^ with io fixed outside a finite set A and io' fixed outside a finite set 
A', we retain in the products only those terms that intersect A and/or A', i.e. 

(fi X T){duj/^,duj'j^,\uj/^c^uj'j^,c) = const (c<;ac,c<;^/c) x 



n e"'"^^^ n Ucos^,co'JY[dp'A^x)lld'^^ 



iO^ 



xnA^0 x-.xeA'oT B:,nA^0oThoth ^eA ^e^' 

(4.19) 



■*^That is, the set {x: B^ 3 y} is finite for each y Q C 
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Now, the first product is just e"^*^'^*''^*''^ and tlie first and tliird products togetlier 
yield (wlien properly normalized) the kernel tta (cuac , Jcua ) • Therefore, the specification 
n (x) r should be defined as 

{]l(g)T){dujATduj'j^,\(jjAc,(jj'f^,c) = Za,a'(^a=,^a'0~"^ >< 

7rA(cuAc, Jcua) n r^((^A<= X uja)b,,{'^'a'c X'^'a')x) n ^^^i'^'x) ■, 

x:xek' or B^nki^Q) or hoth ^6^' 

(4.20) 
where 

^A,A'(^A=,^A'0 = 

/ 7rA(cuAc, Jcua) n r^((cuAc X cua)b,,(cua/c X cua/):J) n ^'^^(^^^ 

fi^Xfi', a;: a;eA' or _BxnA^0 or both ^£^' 

(4.2i: 

and we have assumed, of course, that Za,a'(^a=,^a'=) ^ 0- [I^ -^a.A'I^a^j^a'c) = O5 then 
(cuac , cu^;c ) is a "forbidden boundary condition", which has to be dealt with as in the 
theory of lattice systems with hard-core constraints [299, 313].] We must now check 
that: 

(a) n (x) r, thus defined, is indeed a specification. 

(b) If /i is any measure consistent with H, then // X T is consistent with 11 (x) T. 

These two verifications are messy calculations, which the authors are convinced will 
work out (although mental exhaustion prevented them from writing out the full details). 
Things become much simpler when 11 is the Gibbsian specification for an interaction 
$ and a priori measure /i°, and the T^^ are all nonvanishing. Then it is easy to see that 
n (x) r is the Gibbsian specification for the interaction $ (on the lattice C U C) defined 
by 

r$x(^)^ ifX' = 

$x,x'(tu,cu') = I -logr4cuB^,cu^) if X = 5, and X' = {x} (4.22) 

V otherwise 

and a priori measure /i° X v. (In particular, it follows immediately from the general 
theory in Section 2.3.2 that 11 (x) T is indeed a specification.) This is the interaction 
corresponding to the formal Hamiltonian 

xcc xec 

= Ho„gtnal{^) - ^\0gT^ii0B,,i0'^) . (4.23) 

xec 
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If the Tx can vanish, then $ may take the value +00, which is not (strictly speaking) 
permitted in our formulation; but the same algebra shows that 11 (x) T is indeed a 
specification, at least when Za,a'(^a=,^a'=) ^ 0.^° 

Having constructed the specification 11 (x) T on the lattice C U £', we can now apply 
the same argument as in the decimation case, based on Proposition 2.25 (see Section 
4.1.2, Step 0). Indeed, the renormalized measure fiT is obtained by decimating the 
joint measure fi X T^ i.e. restricting it to the lattice C . 

We hope that someone will come along and simplify our "abstract nonsense" con- 
cerning Step 0. But we have no doubt that our concrete arguments in this paper are 
correct. 

Step 1. Selection of an image-spin configuration io'^-^^i for which the corresponding 
internal-spin system has a non-unique Gibbs measure. 

We need to find an image-spin configuration io'^^^i such that the resulting system 
of internal spins (the "modified object system") has at least two distinct Gibbs mea- 
sures, call them /i_|_ and //_. How we do this depends on the details of the model and 
the renormalization transformation. For the 6 = 2 decimation transformation on the 
nearest-neighbor Ising model, the fully alternating configuration lo'^h does the trick. 
For the majority-rule transformation we shall need a more complicated configuration 
(Section 4.3.4). 

Now let / be a local observable such that //+(/) > //_(/); we shall call / the 
"internal-spin order parameter". (For the decimation example, / is the spin at a 
neighbor of the origin.) 

Step 2. Discontinuity of the internal-spin order parameter as a function of the image- 
spin configuration in a neighborhood of u'^^^i. 

The next step is to study the behavior of the internal-spin system for image-spin 
configurations in a neighborhood (in the product topology) of io'^-^^i. Our goal is to 
show that the order parameter for the internal-spin system is essentially discontinuous 
as a function of the image-spin configuration u' . 

To do this, we first choose image-spin configurations cu^ and lo'_ which we hope will 
"select the phases /i_|_ and //_". We then study image-spin configurations u' which are 
equal to cu^ j^j on some large box A^^, which are equal to cu^ [or lo'_] on some annulus 
A_R' \ h.R {R < R' < 00), and which are arbitrary outside Ari. Our goal is to show 
that, no matter how large R is, the internal-spin phase is selected by the behavior of 
the image spins in Ari \ Ar — for a suitable choice of R' depending on i? — no matter 
what happens outside Ari. 



^'^Many of our concrete examples do have configurations for which Z\\i(u)\c,u)'jy,c) = 0: for ex- 
ample, in the case of decimation, one obviously cannot insist that a certain image spin be -|-1 and 
simultaneously insist that the corresponding original spin be —1! But each of these concrete cases has 
a simple resolution: for example, in the case of decimation, we called internal spm only those original 
spins which are not (locked to) image spins; of course, the original spins which are locked to image 
spins don't even need to be considered. 
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In mathematical terms, our goal is to show that there exists a number 6 > such 
that in each neighborhood J\f 3 ^Leciai {^^ ^^^ product topology) there exist nonempty 
open sets AZ+jA/"- C JV and numbers c_|_,c_ with c_|_ — c_ > 6 such that for every 
id' G A/+ [resp. id' G A/"-] and every Gibbs measure fi for the internal-spin system with 
image spins set to cu', we have //(/) > c_|_ [resp. //(/) < c_]. Now a basis for the 
neighborhoods J\f 3 io'^^ is given by sets of the form 

Mr = {io': Lo' = Lo'^i-f. on A^^, u' = arbitrary outside A^^} , (4.24) 

We shall take Af+^Af- to be sets of the form 

Afn^Ri^jf. = {io': Lo' = Lo'^i-f. on A^^, u' = cu^ on A^ri \ A^^, u' = arbitrary outside A^^/} 

(4.25a) 

Mr^ri- = {io': Lo' = Lo'^n on A^^, u' = lo'_ on A^ri \ Ar, lo' = arbitrary outside A^^/} 

(4.25b) 

We then have to prove that R' can be chosen as a function of R [R < R' < oo) so that 
//(/) satisfies the claimed bounds. 

In practice, the only way we shall be able to prove the existence of such an i?' < oo 
is to prove that the internal-spin system with R' = oo and io' G AfR^co,+ or JVr^co,- has 
a unique Gibbs measure, and that this measure satisfies the required bounds. It will 
then follow fairly easily that the (possibly non-unique) Gibbs measures for R' < oo 
tend to this unique limit as i?' — )• oo, and satisfy the bounds (with a slightly reduced 
6) for some sufficiently large R' . 

We emphasize that since we know only that the conditional distribution ( • )^^i is 
some Gibbs measure for the internal-spin system, we need to prove the claimed bounds 
on /«(/) uniformly for all Gibbs measures for this system. To do this, it suffices to 
show that the bounds are satisfied for a finite-volum.e internal-spin system, for some 
sufficiently large volume, uniformly in the (internal-spin) boundary conditions; that 
is, it suffices to show that there exists R" < oo such that the Gibbs measure for the 
internal-spin system in the volume A^h, with image spins io' G A/i?,_R',+ (or A/i?,_R',_) 
and arbitrary internal-spin boundary condition {o';};g(22)'"nA'"'5 satisfies the claimed 
bounds. For simplicity we shall take R" = R' . 

Let us emphasize once again that both the image spins and the internal spins 
are arbitrary outside A^^/, but for different reasons. The image spins are arbitrary 
outside Ari because our computation of the conditional probabilities //( • \lo') is valid 
only for (/ir)-almost-every cu'; therefore, to prove that these conditional probabilities 
are essentially discontinuous (i.e. cannot be made continuous by modification on a 
set of (/ir)-measure zero), we must prove our bounds for a nonempty open set of 
configurations u' . The internal spins are arbitrary outside Aru (= Ari) because we 
know only that the conditional measure //( • \lo') is some Gibbs measure for the modified 
object system, but we have no idea which one (in case it is non-unique); therefore, we 
must prove bounds valid for all infinite- volume Gibbs measures of the modified object 
system. 
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Step 3. Unfixing of the spin at the origin. 

The final step is to sliow tliat if tlie system of internal spins is slightly modified 
by changing the interaction with a few (in the our examples just one) image spins 
close to the origin, the order parameter at these extra spins differs little from the value 
at internal spins close to the origin. This is the step of "unfixing" some image spins 
discussed above. 

Conclusion of the argument. 

Combining the conclusions of Steps 2 and 3, we have that for all possible image-spin 
configurations outside A^^/, the order parameter at image spins close to the origin is 
determined by the image spins in the arbitrarily faraway annulus A^ri \ A^r. In mathe- 
matical terms, the conditional probability distribution of the image spin at the origin 
is an essentially discontinuous function of the other image spins, in a neighborhood of 
'^'specific- Thus, the renormalized measure has non-quasilocal conditional probabilities: 
it is not consistent with any quasilocal specification, and in particular is not the Gibbs 
measure of any uniformly convergent interaction. 

4.3 GrifRths-Pearce-Israel Pathologies III: Some Further Ex- 
amples 

In this section we apply the Griffiths-Pearce-Israel method to prove non-Gibbsianness 
of the renormalized measure in the following additional examples: 

• b = 2 decimation for the Ising model in dimension d > 3. 

• Decimation with spacing 6 > 3, for the Ising model in any dimension d > 2. 

• The Kadanoff transformation with finite p and arbitrary block size 6 > 1, for the 
Ising model in any dimension d > 2. 

• Some cases of the majority-rule transformation for the Ising model in dimension 
d = 2. 

• Block-averaging, with even block size 6, for the Ising model in any dimension 
d>2. 

Finally, and most strikingly, we can show that in all of these examples except (and 
this probably only for technical reasons) the majority-rule case, there is in fact an 
open region in the (J, h)-plane for which the renormalized measures are non-Gibbsian. 
Therefore, the Griffiths-Pearce-Israel pathologies are not associated with the fact that 
the original model is sitting on a phase-transition surface. Rather, it suffices that a 
first-order phase transition can be induced in the internal-spin system by choosing an 
appropriate block-spin configuration. For this we need to work at low temperature but 
not necessarily at zero magnetic field. 
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4.3.1 Israel's Example in Dimension d > 3 

In this section we study the 6 = 2 decimation transformation for the Ising model in 
dimension d > 3. 

Step 0. Computation of the conditional probabilities. This has already been done. 

Step 1. Choice ofio'^-^^i. As in the two-dimensional case, we choose io'^-^^i to be the 
fully alternating configuration io'^n- The system of internal spins for a fully alternating 
image-spin configuration again corresponds to a periodically diluted ferromagnet: an 
internal spin with all but one of its coordinates even — that is, one which is adjacent 
to two image spins — has two less neighbors coupled to itself, while all other internal 
spins are unaffected. The only difference from the two-dimensional case is that the 
resulting lattice is not merely a decorated version of an exactly soluble Ising model, 
so we cannot write an explicit formula for its critical temperature. Nevertheless, it is 
easy to show that the internal-spin system does have a phase transition, and that at 
low enough temperature there exist distinct Gibbs measures /i_|_ and //_ with strictly 
positive and strictly negative magnetization, respectively; these phases can be selected 
by using, for example, "-|-" or "— " boundary conditions. These claims follow easily 
from a Peierls argument (for a description of such arguments, see e.g. [169]). They 
can alternatively be proven by observing that the diluted system is a collection of 
[d — I)-dimensional diluted and undiluted Ising models, ferromagnetically coupled. In 
particular, the J-dimensional diluted system is more ferromagnetic than the [d — I)- 
dimensional undiluted Ising model, and hence [169] exhibits spontaneous magnetization 
for all temperatures below the critical temperature Jc,d-i of the [d — I)-dimensional 
undiluted Ising model. 

Step 2. Study of a neighborhood of u'^^^i = io'^n- Next we must find image-spin 
configurations cu^ and (jj'_ that will "select" the phases /i_|_ and //_ of the internal-spin 
system. The choice is obvious: as in the two-dimensional case, we take cu^ (resp. (jj'_) to 
be the configuration with all spins -|- (resp. all spins — ). We need then to show that if 
the image spins in A^"^^ are fixed in a fully alternating configuration, and those in an 
annulus A^"^^ \ y\™"s^ are set to all -|- (or all — ), then for R' large enough (depending 
on R) the image spins in the annulus are capable of determining the internal-spin 
phase. 

In two dimensions we were able to take R' = R-\-2. That is, we were able to shield 
off a volume by fixing around it a single layer of image spins: namely, by setting the 
image spins only in layer Tr^2 to be -|-, we were able to guarantee that the effective 
magnetic fields felt by the internal spins in A^|2 ^^^ ^-U nonnegative, even if all the 
spins (both image and internal) outside A7^_|_2 are set to be — [see Figures 4(a)-(b)]. 
This situation does not, however, persist in higher dimensions: a layer r^"2^^ of -|- 
image spins does not protect all of the internal spins in r^|2 from the possible — spins 
in layer Tr^3 [see Figure 6]. Therefore, we have to resort to a more general argument 
to show that there exists a shielding layer, though thicker. Consider, therefore, the 
system of internal spins in volume A^/, with the image spins in A^"^^ fixed in the 
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'-R+3 

(worst case) 




Figure 6: Why a single layer does not work in J > 3. For the "worst" configuration of 
the next external layer (image and internal spins all "— "), some of the internal spins 
in layer r^|2 pick up a negative magnetic field. 
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alternating (it) configuration, the image spins in the annulus A^"^^ \ y\™"s^ fixed to 
be all +, and the spins outside Ari (both image and internal) fixed in an arbitrary 
configuration ct. We denote expectations in this system by ( • )±\-^- We want to show 
that, for J sufficiently large, there exists c > such that for all i? > there exists 
R' > R (depending on R) such that 



and by symmetry 



(^.)K^ > (^.)±;j;- > c> (4.26) 



(^.)±;-'^ < (^.)±;-;+ < -c < , (4.27) 



for every configuration a outside Ari and every i G A^*. This will be proven using 
correlation inequalities together with the uniqueness of the Gibbs measure for the 
internal-spin system with image spins set to all + or all — . 

More precisely, the proof of (4.26) will involve a sequence of inequalities comparing 
the following systems of internal spins: 

• The system of internal spins in volume A^/ described above, which we denote by 
(R, R' \ 

• The infinite-volume system of internal spins A™* = (Z )™*, with the image spins 
in A^"^^ fixed in the alternating (it) configuration and the image spins outside 

Ar fixed to be all +. We denote this system by ' . 

• The infinite-volume system of internal spins A™* = (Z )™*, with the image spins 
everywhere fixed in the alternating (it) configuration. We denote this system by 
/oo\_/i?, oo\ 

We shall prove the following: 



i?, oo 

±, + 



Step 2.1) /W-|-'_|_5r converges as i?' — )• oo to a Gibbs measure for the system 

Step 2.2) The system ' has a unique Gibbs measure, call it /W±^+ . 

Step 2.3) The measure /U±'+ is larger (in FKG sense) than all Gibbs measures for the 

Step 2.4) Let //^ be the + phase (i.e. the maximal Gibbs measure in FKG sense) for 
the system . Then //^ (c'i) > c > 0. 
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From these results we will then deduce (4.26). 



Step 2.1. The limit i?' — )• oo. We wish to consider the limit as i?' — )• oo of the measures 
/i_i_'_l_- (for fixed R). By compactness, this sequence of measures has at least one limit 
point in the weak topology (in fact, any subsequence has a limit point). We claim that 
:th 



/i_i_'_l_- (for fixed R). By compactness, this sequence of measures has at least one limit 

ict , a 
any limit point of the measures /U_|_'_|_ -^ (with arbitrary If) is necessarily a Gibbs measure 



for the system I ' . The proof is trivial: for any volume A C A/^/.i, the DLR 

equations for the systems ' _ and ' are identical (i.e. the tta's 

are the same); so for large enough i?', the measure /W-|-'_|_^ satisfies the DLR equation 
in volume A also for the system ' . Since the latter system's specification is 

Feller, the DLR equations are preserved under a weak limit. 

Note that we have not yet proven that the limit as i?' — )• oo exists; different con- 
vergent subsequences might a priori have different limits. But in the next step we will 

prove that the Gibbs measure for the system ' is unique., so in fact the limit 

doe.s exist. 

Step 2.2. Unique Gibbs measure for the system ' . Consider the infinite- 



volume system of internal spins (Z )™*, with the image spins in A^"^^ fixed in the 
alternating (it) configuration, and the image spins outside A^^ fixed to be all +. We 
claim that this system has a unique Gibbs measure. (We only need uniqueness at low 
enough temperature, but in fact the Gibbs measure is unique at all temperatures.) This 
uniqueness is intuitively obvious: the effective magnetic fields induced by the + image 
spins outside A^^ are sufficient to push the system into the + phase. Unfortunately, 
the proof we have to offer is a bit too complicated for our taste. It goes as follows. 
First, we notice that it is enough to prove uniqueness of the Gibbs measure when 
all the image spins (including those inside A^^) are set in the "+" position. Indeed, 
changing the image spins inside A^^ amounts to a finite-volume perturbation of the 
system and hence it does not alter the number of Gibbs measures [157, section 7.4]. 
[In fact, every Gibbs measure //' for the perturbed interaction comes from a uniquely 
defined Gibbs measure fi of the unperturbed interaction: if W is the perturbation, then 



/(•) = M-e-^)Me-^).] 

To prove the uniqueness of the Gibbs measure for the system with all image spins 
"-|-", we provide two arguments. First argument, proving uniqueness only at low 
temperature: Pirogov-Sinai theory [328, 260, 322] implies that the phase diagram at 
low enough temperature is a small deformation of that at zero temperature, but in 
this case there is only one ground state (namely, all spins "-|-"). Second argument, 
proving uniqueness at all temperatures: The internal-spin system is an Ising model on 
a periodic lattice, with nearest-neighbor coupling J > and a periodic magnetic field 
hx = hS^'^' (here 6^'^' = I if x neighbors an image spin, and otherwise), specialized 
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to h = +J. By the Lee- Yang theorem ([160, Section 4.5] or [250] and references cited 
therein) and a result of Lebowitz and Penrose [242] (see also [169, Theorem 4.4]), it 
can be shown that the pressure of such an Ising model is a jointly analytic function 
of J and h on the domain J, /i > 0. It follows that all periodic Gibbs measures give 
the same mean value to the observables conjugate to J and h: these observables are, 
respectively, Y^^i^j where the sum runs over all nearest-neighbor pairs (ij) in a unit 
cell of the periodic lattice, and Y, ^k where the sum runs over all sites k in this unit 
cell that are nearest neighbor to an image spin. By Griffiths' comparison inequality, it 
follows that 

ji-{cr,crj) = ji+{cr,crj) (4.28a) 

/i_(afc) = i^+{(Jk) (4.28b) 

for every pair (ij) of nearest neighbors and for every site k neighboring an image spin; 
here /i_|_ and //_ are the measures corresponding to "-|-" and "— " boundary conditions, 
respectively. We then resort to the inequality [236] 



/i+(a^)-/i_(a^) > /i+(a^)/i_(a^a^)-/i_(a^)/i+(a^a^) (4.29) 

valid for any sets A, i? C Z (we denote a^ = YlieA'^i)- From (4.28) and (4.29) we 
conclude that 

/._(a^) = /.+ (a^) (4.30) 

whenever a^ is a product of functions of the form aiaj with i,j nearest neighbors 
and (Tk with k being a neighbor to a image-spin site. (In other words, A must be the 
symmetric difference of a family of such sets {z,j} and/or {k}.) But it is not hard to 
see that all sets A C (Z )™* are of this form, hence 

fi- = fi+ . (4-31) 

Now by the FKG inequality //_ < /O < /«+ in FKG sense^^ for every Gibbs measure 
/?, hence there is a unique Gibbs measure at all temperatures. (This argument is 
essentially due to Lebowitz [236, 237], with minor alterations to accommodate periodic 
systems.) 

Step 2.3. Comparison to the system. We claim that the measure /U±^+ is larger 

(in FKG sense) than all Gibbs measures for the system . This is an immediate 

consequence of the FKG inequality combined with the uniqueness proved in Step 2.2. 
Indeed, by the FKG inequality, the finite-volume Gibbs measure for the i' , 



^"'^We write cr < cr' in case Ui < a'^ for all sites i. An observable / is said to be mcreasmg if 
/((t) < /(c') whenever a < a' . We say that /j, < i> in FKG sense in case /u(/) < i^(f) for all increasing 
local observables /. 
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system with any (internal-spin) boundary condition is larger in FKG sense than the 
finite- volume Gibbs measure for the system with the 

This inequality passes directly to the infinite- volume limit. 



finite- volume Gibbs measure for the system with the same boundary conditions. 



Step 2.4- Spontaneous magnetization for the -\- phase of the system. The 

system is precisely the Ising model on a periodically diluted lattice. As discussed in 
Step 1, this model has spontaneous magnetization for J sufficiently large. 

Step 3. Unfixing of the spin at the origin. Finally, we can "unfix" the spin at the 
origin in the same way as in the 2-dimensional example. 

Conclusion of the argument. We conclude that in every neighborhood of lo'^h there 
are open sets Af+^Af- such that 

i^.TKI{^:Wo)(cui) - i?,TKI{^:Wo)(cu2) > .5 > (4.32) 

for LOi G A/+ and LO2 G Af-. As in the 2-dimensional case, this implies the non- 
quasilocality of the renormalized measure //T, for any original Gibbs measure //. This 
works for any temperature below the critical temperature of the undiluted [d — 1)- 
dimensional Ising model. We have therefore proven: 

Theorem 4.2 Let d > 2. Then for all J > Jc^d-i, the following holds: Let fi be any 
Gibbs mea.sure for the d-dimen.sional Lsing model with nearest-neighbor coupling J and 
zero magnetic field. Let T be the decimation tran.sformation with .spacing 6 = 2. Then 
the mea.sure fiT is not consistent with any quasilocal specification. In particular, it is 
not the Gibbs mea.sure for any uniformly convergent interaction. 

4.3.2 Decimation with Spacing 6 > 3 

The conclusions of Theorem 4.2 for decimation with spacing 6 = 2 hold also for larger 
spacings. The main difference from the 6 = 2 case is that for 6 > 3 the system of 
internal spins obtained with u' = lo'^h is no longer simply a periodically diluted Ising 
model in zero magnetic field; rather, it contains a periodic alternating magnetic field 
which is nonzero at the sites neighboring an image spin. As a consequence, we need a 
more sophisticated technique to conclude that there is indeed a phase transition (Step 
I). The appropriate tool for this purpose is Pirogov-Sinai theory [328, 329], which 
is summarized in Appendix B. The upshot of P-S theory is that the phase diagram 
of a lattice system at low temperature can in some cases be deduced from the phase 
diagram at zero temperature. More precisely, if there are a finite number of periodic 
ground states, and these ground states satisfy a suitable "Peierls condition", then the 
phase diagram of periodic Gibbs measures at low temperature is a small perturbation 
of the phase diagram of ground states. In the case at hand, one can show that for 
the fully alternating block-spin configuration, the system of internal spins has only 
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two periodic ground states — namely, the one with all internal spins +, and the one 

with all internal spins and that these ground states satisfy the Peierls condition. 

It follows from P-S theory that at low temperature there are precisely two periodic 
Gibbs measures, /i_|_ and //_, characterized respectively by a strictly positive or strictly 
negative magnetization. The details of this part of the argument are presented in 
the Appendix B (Section B.5.3). Steps 2 and 3 are then proven in a manner exactly 
identical to the b = 2 case. The analysis of Section B.5.3 yields a (very weak) estimate 
of the range of temperatures for which the pathologies are present [formula (B.79)]. 
We have thus proven the following: 

Theorem 4.3 Let d > 2 and b > 2. Then for all J sufficiently large (depending 
on d and b), the following holds: Let fi be any Gibbs measure for the d-dimensional 
Lsing model with nearest-neighbor coupling J and zero magnetic field. Let T be the 
decimation transformation with spacing b. Then the measure fiT is not consistent with 
any quasilocal specification. In particular, it is not the Gibbs mea.sure for any uniformly 
convergent interaction. 

Remark. Checkerboard decimation, as shown in Figure 2(b), is a very different 
situation: the internal spins are not connected, and hence they cannot cooperate to 
have a phase transition. In fact, in this case the first iteration of the transformation 
is well-defined [364] [346, p. 193]. However, the second iteration of this transformation 
corresponds to a single iteration of the 6 = 2 decimation transformation, and so is 
ill-defined at low enough temperature. 

4.3.3 KadanofF Transformation with p Finite 

In some sense the results thus far should not be surprising: the decimation transforma- 
tion, unlike other RG transformations, does not in any sense integrate out the "high- 
momentum modes" and leave the "low-momentum modes" ; it merely integrates out one 
sublattice and leaves another. In particular, if the sublattice of internal (integrated- 
out) spins is connected , it is hardly surprising that the internal-spin system can exhibit 
a phase transition, and that this can give rise to RG pathologies. 

In this section we show something considerably more surprising: that the same 
pathology — non-Gibbsianness after one renormalization step — is present at low 
temperature for the Kadanoff transformation with any finite (but nonzero) p.^^ This 
result is in clear confiict with the RG ideology, which states that integration over 
high-momentum modes cannot produce singularities. (Indeed, our proof makes no 
distinction between block sizes b > 2 and b = 1 — and for b = 1 one is not integrating 
over any "modes", high-momentum or otherwise!) In the next subsection we shall prove 
a similar result for some majority-rule transformations (i.e. Kadanoff with p = oo). 



®^In earlier versions of this work [352, 353], we claimed this result only for small p. Subsequently 
we found a proof valid for all < p < co, which we present here. 



113 



Consider the KadanofF transformation (3.10) with block size b and parameter p. 
From (3.10) one readily concludes [55] that for each choice of block spins a' the condi- 
tional probabilities of the internal spins a correspond to a Hamiltonian 

Hesicr) = -JJ2'^''^3 ~ PJl'^'x Yl cTj + ^ log 2 cosh fp ^ crA . (4.33) 

This is the original Ising-model Hamiltonian perturbed by a block-dependent magnetic 
field and an antiferromagnetic multi-spin coupling. To obtain non-trivial results we 
consider blocks at least of size 2 in each coordinate direction. It is natural to expect 
that, for any fixed p < oo, for sufficiently large J (i.e. low enough temperature) the 
perturbation become effectively small, and the phase diagram a small deformation to 
that of the original Ising model. 

We notice, however, that there is a small difference with the original perturbative 
setting in that the last two terms in (4-33) do not include a temperature factor . In the 
study of deformations of phase diagrams, one considers a fixed value of /3 multiplying all 
the terms of the Hamiltonian, and analyzes the consequences of changing (perturbing) 
some of the remaining parameters. The proof that the deformations are smooth usually 
requires that the size of this perturbation not exceed a certain /3-dependent bound. In 
our case, after pulling out a common factor /3, the parameters of the perturbation 
acquire a /3-dependence and one is confronted with the problem of verifying that this 
/3-dependent size is smaller than the /3-dependent bound. This problem is especially 
serious in the case of the last term in (4.33), which does not have any small parameter 
preceding it, so that its size decreases only as 1//3. We conclude that to successfully 
complete Step 1 we need a slight strengthening of the usual PS theory, involving /am«7«es 
of interactions, and showing that the deformations of the phase diagram are small 
uniformly in members of this family. Such a strengthening is discussed in Appendix B 
(Corollaries B.25 and B.29). 

For Step 1, then, we choose a configuration io'^^^i for the block spins so that the 
middle term in the RHS of (4.33) does not favor any overall internal spin orientation 
— for example, a fully alternating configuration. The "uniform" version of PS theory 
implies (Appendix B.5.4) that at low enough temperature there are two coexisting 
phases /i_|_ and //_. This is the end of Step 1 of the Griffiths-Pearce-Israel argument. 
Steps 2 and 3 are then completed almost identically to the previous examples. 

In this way we conclude: 

Theorem 4.4 Let d > 2, b > 1 and < p < oo. Then there exists a Jq (depending on 
d, b and p) such that for all J > Jq the following holds: Let fi be any Gibbs measure 
for the d-dimensional Lsing model with nearest-neighbor coupling J and zero magnetic 
field. Let T be the Kadanoff tran.sformation with parameter p and block .size b. Then 
the renormalized mea.sure fiT is not consistent with any quasilocal .specification. In 
particular, it is not the Gibbs mea.sure for any uniformly convergent interaction. 

A (poor) estimate of the smallness of the temperature is given in formula (B.87). We 
emphasize that our estimate Jq is nonuniform in p. As a result, we are not able to 
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take p — )• oo at any fixed J, and thereby treat the majority-rule map. (Our partial 
results on the majority-rule map, obtained by a different method, will be described in 
the next subsection.) 

Remarks. 1. The occurrence of "peculiarities" in the Kadanoff transformation at 
small p and low temperature was suggested already by Griffiths and Pearce [i72, i73]. 

2. Interesting applications of the Kadanoff transformation (with block size b = I!) 
arise in image processing [152, 154, 63, 158, 129], speech recognition [302] and other 
fields of applied probability theory. The basic theoretical construct in these fields is a 
class of models termed hidden Markov models [302, 153, 248]; in our language these 
are simply the images of Markovian (i.e. nearest-neighbor) spin models under local 
renormalization transformations. It has been long recognized that such measures can 
be very far from Markovian; here we have shown that they can even be non-Gibbsian. 

Consider, for example, an Ising-model Gibbs measure corrupted by white noise: 
with probability e a spin is observed incorrectly, independently at each site. This is 
model I of Griffiths and Pearce [172, 173], and is equivalent to the Kadanoff trans- 
formation with p = tanh~ (1 — 2e) on blocks of size 6=1. For any e > 0, we have 
proven that the image measure is non-Gibbsian for (J, h) in an (e-dependent) open 
neighborhood of the low-temperature zero-field region. This system is of interest in 
applications to image processing [152, 129]. 

4.3.4 Majority-Rule Transformation 

Next we wish to show that Griffiths-Pearce-Israel pathologies occur also for the majority- 
rule transformation (i.e. the Kadanoff transformation with p = oo). For simplicity let 
us consider the case of an odd block size 6, so as to avoid the complications caused by 
ties. In view of the foregoing examples, it is natural to try a fully alternating block- 
spin configuration io'^n- Using Pirogov- Sinai theory, one might hope to prove that at 
low temperature the internal-spin system has precisely two extremal periodic Gibbs 
measures: a "-|-" phase in which the internal spins show an overwhelming majority of 
-|- spins in blocks where the block spin is -|- but only a weak majority of — spins where 
the block spin is — , and a "— " phase with the reverse behavior. This result would 
in fact follow if one could show that there are precisely two periodic ground states: a 
"-|-"-like state in which the internal spins are unanimously -|- in blocks where the block 
spin is -|- and show a bare — majority where the block spin is — , and a "—"-like state 
with the reverse behavior [Figure 7(a)]. Unfortunately, neither the shape nor the posi- 
tion within a block of these "minimal islands" of minority spins is in general uniquely 
determined [Figure 7(b)]; therefore, this family of states is infinitely degenerate, and 
we cannot apply P-S theory (at least in its usual form). Moreover, it turns out that 
these configurations are not even ground states: there are "strip-like" states of lower 
energy density [Figure 7(c)]. We believe that these strip-like states are truly ground 
states (though we have not proven it); and since they too are infinitely degenerate, P-S 
theory cannot be applied. 
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Figure 7: (a) The hoped-for structure of the "-|-"-hke ground state, (b) Indeterminacy 
of the shape and position of the minimal islands of — spins, for the case of a 3 X 3 block. 
The energy per island is 20J, irrespective of its shape. The energy density is lOJ per 
block, (c) Strip-like states with an energy density of 8J per block. These states also 
have an indeterminacy in each block. 
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We suspect that for each odd 6 > 3 there do exist block-spin configurations (more 
comphcated than io'^n) for which the Griffiths-Pearce-Israel argument can be carried 
through, but for 6 = 3,5 we have been unable to find any. The simplest case in which we 
managed to avoid these problems is 6 = 7. Here a bare majority in a 7 X 7 block consists 
of 25 spins, and the unique minimal-energy configuration for an island of 25 or more 
spins is a 5 X 5 square. By taking a doubly-alternating block-spin configuration [Figure 
8(a)], we can force these 5x5 squares to be positioned in a unique minimal-energy 
way [Figure 8(b)]. The energy of this arrangement is 80 J per group of eight blocks, or 
lOJ per block. On the other hand, strip-like states would cost at least 14J per block. 
Therefore, with this block-spin configuration the internal-spin system has precisely 
two ground states: the "-|-"-like state depicted in Figure 8(b), and the reverse "—"-like 
state. It then follows from P-S theory that at low enough temperature the internal- 
spin system has two extremal periodic Gibbs measures, /i_|_ and //_, characterized by 
a nonzero position-dependent magnetization of opposite signs. The ingredients of the 
rigorous argument showing that indeed the "-|-"- and "—"-like configurations of the 
type of Figure 8(b) are the only ground states, and that PS theory is applicable, are 
summarized in Section B.5.5. This completes Step 1, which is the hard part of the 
proof. 

The proof of Step 2 relies again on P-S theory and the FKG inequality. Step 2.1 
is proven in the usual way. The fact that the system with -|- block magnetization 
outside a square A^^ has a unique Gibbs measure is a consequence of P-S theory: at 
zero temperature this system has a unique ground state, namely the state with all 
spins -|-, and P-S implies (see Section B.5.2) that this trivial phase diagram persists at 
low enough temperature. This proves Step 2.2. Finally, we claim that the system with 
-|- block spins outside a square A^^ (and doubly alternating block spins inside) has a 
larger magnetization than the system with doubly alternating block spins everywhere. 
Indeed, the constraint that the majority of internal spins in a block B he — (resp. 
-|-) can be imposed by including in the Hamiltonian a term —hBSgn[Y,ieB^i) with 
hs — ^ — oo (resp. hs — ^ +oo). Since sgn[J2ieB^i) i^ ^^ increasing function of the 
spins, the FKG inequality implies that the magnetization at any site is an increasing 
function of hs- This proves Step 2.3. 

Step 3 is proven in the usual way. We therefore conclude: 

Theorem 4.5 For all J sufficiently large, the following holds: Let fi be any Gibbs 
measure for the two-dimensional Ising model with nearest-neighbor coupling J and zero 
magnetic field. Let T be the majority-rule transformation on 7 x7 square blocks. Then 
the mea.sure fiT is not consistent with any quasilocal specification. In particular, it is 
not the Gibbs mea.sure for any uniformly convergent interaction. 

It is of course unnatural and unpleasant for this result to be restricted to the special 
case of 7 X 7 blocks. This restriction was necessary only in Step 1 (the proof of a phase 
transition for some fixed block-spin configuration); it arose from the necessity to obtain 
a finite number of periodic ground states in order to apply P-S theory. All the other 
steps in the proof remain valid for blocks of arbitrary size and for Ising models in 
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Figure 8: Majority rule for 7x7 blocks, (a) The doubly-alternating block-spin config- 
uration, (b) The unique minimal-energy arrangement of islands of — spins in a -|- sea. 
The energy is 80J per 8 blocks, or fOJ per block. 
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arbitrary dimension. Digging a little deeper we see that the "rigidity" in the shape 
and position of the islands of minority spins, and hence the boundedness of the number 
of periodic ground states, is a consequence of the following numerological "miracle": 
the block size 6 = 7 and the island size c = 5 satisfy the Diophantine equation 



l + b' = 2c\ (4.34) 

The proof extends automatically to any block size b for which c defined by (4.34) is an 
integer. In Appendix C we find the general solution to this Diophantine equation: the 
admissible block sizes turn out to be 

bk = I [(1 + ^)''+' + (1 - V2f''+'] (4.35) 

for k = 1, 2, 3, ... . The first few bk are 7, 41, 239, 1393, 8119, .... For other block sizes, 
a proof of non-Gibbsianness using our methods would require either a more clever choice 
of block-spin configuration cu^ j^j, or else a more sophisticated version of P-S theory 
capable of dealing with infinitely many periodic ground states [46, 179, 22, 23, 48]. 
Irrespective of these technical details, it seems plausible to expect that the conclusion 
of Theorem 4.5 remains valid for all block sizes b. 

Remark. Griffiths and Pearce [172, 173] and later Hasenfratz and Hasenfratz [189, 
Section 4] have presented a rather different class of cases in which the majority-rule 
transformation is expected to have "peculiarities": in these examples the block-spin 
configuration cu^ j^j is taken to be purely -|-, and the magnetic field is taken to be 
negative (with an order- 1 strength chosen to exactly compensate the effect of the block 
spins). Our scheme of proof does not apply in these examples, for two reasons: Firstly, 
there are infinitely many periodic ground states, so P-S theory in its usual form does 
not apply. Secondly (and perhaps more seriously), in the configuration cu^ j^j all of the 
block spins are already -|-, and by construction the corresponding internal-spin system 
does not have a unique Gibbs measure; so it is clearly impossible to "select" the -|- 
phase (i.e. make it unique) by setting the block spins in an annulus to be -|-. This 
latter fact was already noted by Israel [207, p. 597]. 

4.3.5 Block-Averaging Transformations 

In contrast to our previous example, in this case our proof works for even block sizes 
(and only these) precisely because of the possibility of ties. We discuss here the simplest 
case, namely the 2x2 block-averaging transformation for the two-dimensional nearest- 
neighbor Ising model at low temperatures. We divide Z into 2x2 blocks Bj, and 
define 

<t', = E ^- (4-36) 

zeB, 

We notice that although the original variables a take two values (il), the renormalized 
spins cr' take five values (0, ±2, ±4). Usually the average spins are rescaled, but such a 
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Figure 9: The four periodic ground states of the internal-spin system obtained by 
constraining the block spins to be zero. 



rescaling is irrelevant for our discussion because we only consider a single application 
of the transformation and do not iterate. 

Step 1. We choose the configuration cu^ j^j defined by cr' = for all j (^ ~L . 
The resulting system of internal spins has, at low temperatures, four periodic Gibbs 
measures corresponding to four ground states formed by infinite alternating strips 
of thickness 2 (see Fig. 9). This follows immediately from Pirogov-Sinai theory (see 
Appendix B.5.7). 

Step 2. Let A be a ^N x ^N square. Take block-spin boundary conditions as 
follows: -|-4 for the rows of block spins immediately above and below A, -\-2 for the 
columns immediately to the right and left of A, and -|-4 for the columns just to the right 
and left of these [see Fig. 10(a)]. A slight modification of the usual Peierls argument 
proves that these boundary conditions induce at low temperature the Gibbs measure 
associated to ground-state ^3 in Fig. 9 [see Fig. 10(b)]. 

Step 3. We unfix two nearest-neighbor block spins: the one at the origin and the 
one immediately above it. Then, at sufficiently low temperature, one has with high 
probability the boundary condition of Fig. 11 for the two-block system (0',0')-(0', 1') 
(for a suitable positioning of the volume A). Notice that this boundary condition has 
eight -|- spins and only four — spins; therefore, it is clear that the spins inside the two 
blocks are biased towards -|-, so that 



p'a, 



;9A4 



[dt 



special ''-^^4;4,2v (0' fi') > r" yv,u;^p^^.^j ;ayU;4,2 



p'a, 



;9A4 



a 



(<d',i')j 



> c+{J) > 



(4.37) 



at zero magnetic field. [Indeed, at low temperature there is a probability sa | of having 
a strip configuration with crL, q,n = crL, ^,^ = and a probability sa | of having an all--|- 



configuration with crL, Q,^ = cr, 



(o',i') 



-|-4, so that limj^ooC+(J) = -\-2.] Similarly, by 



reversing the sign of the block spins on the boundary, we obtain 



/^A, 



;9A4 



special ' 

where of course C-[J) 



cr 



/^A, 



(Jt 



< c_{J) < 



(O'fi') — rk,i^' . ,;9A4^4 2 1*^(0', 1')^ 

^ ' ' ' special' ^1^1^ V ' / 

c_|_(J) by symmetry. This completes the argument. 



(4.38) 



Remark. Notice that it does not suffice to unfix a single block spin to distinguish 
among the four Gibbs measures, because in all four measures the boundary condition 
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Figure 10: Block-spin boundary conditions chosen lor Step 2. 
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Figure 11: Boundary conditions lor the block-spin observable ol Step 3. 
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on the unfixed block would be symmetric between + and — (i.e. four + spins and four 
— spins), and the expectation of the block spin would be zero. 

The argument given here clearly works for any even block size 6 > 2, in any lattice 
dimension d > 2. In this way we conclude: 

Theorem 4.6 Let d > 2, and let h > 2 he even. Then for all J sufficiently large 
(depending on d and h), the following holds: Let fi be any Gibbs measure for the d- 
dimensional Lsing model with nearest-neighbor coupling J and zero magnetic field. Let 
T be the block-averaging transformation with block size b. Then the measure fiT is not 
consistent with any quasilocal specification. In particular, it is not the Gibbs mea.sure 
for any uniformly convergent interaction. 

4.3.6 Generalization to Nonzero Magnetic Field 

It might appear from our results thus far that the RG pathology is somehow associated 
with the fact that the original Hamiltonian H lies on the phase-coexistence curve 
(which in the lsing model means zero magnetic field). This is in fact not the case. In 
this section we include a magnetic field /i, and show that in dimension J > 3 there is an 
open region in the [J^h)-plane — namely, low enough temperature and small enough 
field — where the decimation transformation produces a non-Gibbsian measure after 
one iteration. (We suspect that the result is true also for J = 2, but it will require a 
different proof.) Our argument works for decimation with arbitrary scale factor 6, and 
for the Kadanoff transformation with any p < oo. Moreover, for block- averaging we 
have an even stronger result: the renormalized measures are non-Gibbsian for arbitrary 
strength of the field in dimensions J > 2, at low temperatures. We conclude that the 
Griffiths-Pearce-Israel pathologies are not associated with the fact that the original 
model is sitting on a phase-transition surface. Rather, it is the system of internal spins 
constrained by the configuration cu^ j^j which must have a phase transition. If this 
transition is of a similar type as that of the original system, then it is natural to expect 
that the original system must at least be close to a phase transition, in some sense. 
But even this need not be the case, as the example of block-averaging transformations 
will show, if these two transitions are of different nature. ^"^ 

Let us first treat the case of decimation transformations. Consider, therefore, an 
interaction 

H = -JJ2a,a,-hJ2<^^■ (4.39) 

(Note that in our normalization, the magnetic field is not explicitly multiplied by any 
factor of J or /3.) The idea is that for suitable (small) values of h = h/J^ we can find 
an image-spin configuration cu^ j^j for which the corresponding internal-spin system 
has a non-unique Gibbs measure. Roughly speaking, cu^ j^j must be such that it 
"compensates" the effect of the magnetic field, so that the system of internal spins 



^For this reason, the title of our earlier report [353] is in retrospect somewhat misleading. 
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subjected both to the homogeneous field h and to the inhomogeneous field due to the 
image spins has two or more extremal Gibbs measures. 

As in all the previous cases, we formalize this idea in two steps: we first show 
that it works at zero temperature, namely that there are choices of h and cu^ j^j for 
which the ground state is not unique; and second we show that Pirogov-Sinai theory 
is applicable so that it implies nonuniqueness of Gibbs measures at low but nonzero 
temperatures. The simplest case is to let cu' ^^j^j be periodic}'^ In this situation one 
can find the "compensating" field ho needed to obtain more than one ground state by 
studying configurations inside a period. We do not want to enter into the details, as we 
later offer a better and more general procedure, but we make the rather obvious remark 
that the field must be taken in a direction opposite to that of the majority of internal 
spins within a period. The value of the field must be such that if the internal spins 
follow it, the energy gain is exactly compensated by the penalty paid by the internal 
spins neighboring the image spins of opposite sign. (In other words, the sum of all 
the fields — external or due to image spins — felt by the internal spins in a period 
must be zero.) This field strength ho is too weak to favor the hipping of small regions 
of internal spins, and only a collective fiip is energetically acceptable. Of course, this 
delicate balance is broken if the field is changed, no matter how little. Therefore, we 
conclude that this value ho = ^o(^speciai) i^ such that for h > ho (resp. h < ho) the 
internal-spin system has only one periodic ground state, namely all spins + (resp. all 
spins — ), while for h = ho there are precisely two periodic ground states, namely all 
spins + and all spins — . 

For the second step — relying on Pirogov-Sinai theory — we already have two of 
the required conditions (see Appendix B): a periodic (internal-spin) interaction and a 
finite number of ground states. We need in addition to verify the Peierls condition, but 
this can be done basically following the same energy-cost arguments outlined above for 
the determination of ground states. The conclusion is that there exists Jq = «/o (^special) 
such that for J > Jq there is a continuous curve h = h*{J), with limj^+oo h*[J) = ho, 
on which the internal-spin system has precisely two periodic extremal Gibbs measures, 
namely a "-|-" phase and a "— " phase. As long as io'^^^i is not all -|- or all — , Step 
2 can be proven using the FKG inequality, as in Section 4.3.1. We therefore conclude 
that for h = h*[J) the renormalized measure fiT is non-Gibbsian. 

The chief limitation of this procedure is that it produces only rational values of 
ho and that there is no uniformity in ho for the range of temperatures for which the 
nonuniqueness persists. Hence, by letting io'^-^^i range over all periodic configurations, 
we prove non-Gibbsianness only for a region of the phase diagram formed by countably 
many curves h = h*[J). We can prove that the set of ho values is dense in some interval 
|/i| < e but, unfortunately, we cannot conclude non-Gibbsianness for any dense subset 
of an open set in the ( J, /i)-plane: the trouble is that the curves h*[J) corresponding 
to configurations cu^ j^j of very high period may survive only to very low temperatures 
(i.e. we have no uniform control on Jq). 



*This construction was already suggested by Israel [207]. 

123 



If we want to extend this argument to more general choices of cu^ j^j, we are 
confronted with the hmitation imposed by the present versions of Pirogov-Sinai the- 
ory. One possible generalization of this construction is to let io'^-^^i be quasiperiodic. 
Then one can use an extension of Pirogov-Sinai theory due to Koukiou, Petritis and 
Zahradnik [221]. (Actually, these authors require the quasiperiodic part of the interac- 
tion to be small; so we cannot handle decimation, but can handle the Kadanoff trans- 
formation with p small.) In this way we obtain uncountably many curves h = h*[J) on 
which the renormalized measure is non-Gibbsian. (If the results of Koukiou et al. can 
be extended to frequencies which are Diophantine of arbitrary type / < oo — at present 
they treat only 1 = 2 — then the corresponding set of ho values would contain some 
interval |/i| < e except for a subset of Lebesgue measure zero.) However, we still cannot 
conclude non-Gibbsianness for any dense subset of an open set in the (J, h)-plane: the 
trouble is again that we lack uniform control on Jq. 

At any rate, we are able to overcome these technicalities, and we present here an 
argument proving the existence of Griffiths-Pearce-Israel pathologies for an open region 
{J > Jo, 1^1 < SqJ} in the (J, /i)-plane, as originally conjectured by Griffiths and Pearce 
[172, 173] and Israel [207]. The key ingredient is a mechanism to generate a continuum 
of image-spin configurations cu^ j^j such that P-S theory is applicable to the resulting 
internal-spin system. At present this is only possible if we resort to randomness: 
Zahradnik [369, 370, 371], and with less generality Bricmont and Kupiainen [43, 44], 
extended P-S theory to systems with superimposed (small) random interactions for 
dimensions d > 3. Our construction will, therefore, be based on a (slightly) random 
choice of the configuration cu^ j^j and will be limited to d > 3. 

Consider, for starters, decimation with some spacing 6 > 2, applied to an Ising 
model with ferromagnetic nearest-neighbor interaction J and magnetic field h = Jh > 
0. We consider a block-spin configuration which is equal to the fully alternating con- 
figuration except that the spins that would correspond to a "-|-" have a probability 
e/2J of becoming a "— ", independently for each such spin. We wish to show that 
for each sufficiently small positive /i, there exists an e such that the random magnetic 
field induced by the block spins (whose net effect is negative) exactly compensates the 
positive uniform field, in the sense that for almost all such image-spin configurations 
there are two distinct Gibbs measures /i_|_ and //_. To do this, we apply an as-yet- 
unpublished theorem of Zahradnik [370, 371], which generalizes Pirogov-Sinai theory 
to small random interactions, if the lattice dimension is > 3. (In the preprint [370], 
the random interactions are assumed to be small uniformly in all realizations of the 
randomness. This condition is not satisfied in our case, as one has large terms (of 
strength ~ J), albeit occurring with small probability (e/2J). In a private communi- 
cation [371], Zahradnik has informed us that minor modifications of his proofs suffice 
to cover also this case.) 

We apply Zahradnik's theory with the original Hamiltonian Hq taken to be the 
system of internal spins with fully alternating image spins (i.e. a ferromagnetic nearest- 
neighbor Ising model in a periodically diluted lattice and with a periodic magnetic field 
of mean zero, see Section 4.3.2); the symmetry-breaking "fields" are taken to be the 
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uniform magnetic field, and the random negative magnetic fields coming from those 
block spins that were flipped from "+" to "— " according to the procedure explained 
above. The analysis of the ground-state structure of i^o, and the proof of the Peierls 
condition for it, were already carried out in Sections 4.3.2 and B.5.3. Zahradnik's theory 
(Theorem B.3f) then assures us (Section B.5.7) that for each J sufficiently large and 
each e sufficiently small, the phase diagram is, with probability f , a small deformation 
of that of the Hamiltonian Hq] that is, for each such pair (J, e) there exists a unique 
h*{J,e) > such that the system has two distinct Gibbs measures /i_|_ and //_ (which 
can be obtained, for example, by taking h I h* oi h '\ h* ^ respectively). Moreover, the 
value /io(e) at which the "+" and "— " configurations are simultaneously ground states 
is a strictly increasing linear function of e. (This follows from an argument similar 
to, albeit more elaborate than, the one presented at the beginning of the section for 
periodic choices of cu' ^^j^j. The slope depends on the block-size b and the dimensionality 
d.) As Zahradnik's theory tells us that the low-temperature phase diagram is a smooth 
deformation of the zero-temperature one, we conclude that that h* is a continuous and 
strictly increasing function of e. Obviously the case h < can be handled by the same 
argument with "-|-" and "— " reversed. 

The bottom line is, therefore, that there exists — for each J sufficiently large — a 
continuous and monotonic curve €*[h) through the origin, defined for \h\ small, such 
that for almost all choices of the random block-spin configuration the system presents 
multiple Gibbs measures on the curve and a unique Gibbs measure to each side of the 
curve (Figure f 2). Thus, for the Ising model with J sufficiently large and \h\ sufficiently 
small, we can prove Step f by chosing as cu^ j^j any one of the configurations from the 
probability-f set corresponding to e = €*[h). The proof of the validity of Steps 2 and 
3 is essentially identical to that of the case h = (Sections 4.3. f and 4.3.2). We notice 
that due to the smoothness of the phase diagram deformations, the bound \h\ < 6{J) 
for which these steps, and hence the existence of RG pathologies, can be proven is 
given by a continuous function 6{J). Moreover, we have liminfj^oo ^{J) > <^o > 0. 

The final result is the following: 

Theorem 4.7 For each d > 3 and b > 2, there exists a Jq < oo and a Sq > (depend- 
ing on d and b) such that for all J > Jq and \h\ < SqJ the following is true: Let fi be 
any Gibbs measure for the d-dimensional Ising model with nearest-neighbor coupling J 
and magnetic field h. Then the renormalized measure fiT arising from, the decimation 
transformation with spacing b is not consistent with any quasilocal specification. In 
particular, it is not the Gibbs measure for any uniformly convergent interaction. 

Similar results are valid, by a similar argument, for the Kadanoff transformation 
with any fixed < p < oo. However, we are not able to apply such an argument to 
the majority-rule example because we need dimension J > 3. This seems to be only a 
technical reason. 

The proof for block-averaging transformations in a field is much simpler: we do not 
need randomness in the choice of io'^^^i. In fact, the same steps detailed in Section 
4.3.5 above can be applied regardless of whether or not a field is present. Steps I and 2 
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Figure 12: Phase diagram for a random-field Ising model at low temperatures {d > 3). 
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are exactly the same for all values of the field, because the constraint of zero block spins 
removes all field-dependence inside such blocks. (The point is that the ground-state 
configurations, as well as the block-spin boundary conditions needed to select them, 
are the same for all values of the magnetic field.) In Step 3, the presence of a field 
causes an asymmetry between "-|-" and "— " boundary conditions — i.e. we no longer 
have C-[J) = — c_|_(J) — as well as a smaller value for the difference c_|_(J) — C-[J) 
between the two magnetizations. But this difference is still bounded away from zero 
uniformly in A (the extra factor involved depends only on the fields at the eight sites of 
the two-block observable), so the result is still valid. Alternatively, one could "unfix" 
a strip of A^ X 2 blocks. 
Therefore, we have: 
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Theorem 4.8 For each d > 2 and each even b > 2, there exists a Jq (depending on d 
and b) such that the following is true: For any Gibbs measure fi of the d-dimensional 
Ising model with nearest-neighbor coupling J > Jq and arbitrary magnetic field h, 
the renormalized measure fiT arising from, the block-averaging transformation is not 
consistent with any quasilocal specification. In particular, it is not the Gibbs measure 
for any uniformly convergent interaction. 

This result is in contrast with the results obtained above for decimation and Kadanoff 
transformations, where we were able to prove non-Gibbsianness for h ^ only for d > 3 
and only for \h\/J small. The restriction to weak fields is, for these examples, essen- 
tial, because it is known that in a strong field the renormalized measure is Gibbsian 
[172, 173, 207]. Moreover, Martinelli and Olivieri [258] have proven that for any (J, h) 
with /i 7^ 0, the decimation transformation results in a Gibbsian measure when the 
spacing b is large enough (how large depends, of course, on J and h). 

Finally, we note an interesting consequence of our Theorem 4.7: for the Ising model 
in dimension J > 3, in the region {J > Jq, \h\ < SqJ}^ the Dobrushin-Shlosman [94, 96] 
complete analyticity condition is violated. 

4.4 Large-Cell Renormalization Maps in Dimension d ^ 4 

B&at Five years ago, Lebowitz and Maes [239] constructed a very different example of 
a non-Gibbsian measure, arising in the study of entropic repulsion of a surface by a 
wall. Subsequently, Dorlas and van Fnter [103] generalized this example, and pointed 
out its relevance for the renormalization-group theory of Ising-like models in dimension 
d (^) 4. In this section we present a slightly generalized version of the Lebowitz-Maes- 
Dorlas-van Fnter theorem on non-Gibbsianness, and then discuss its relevance for RG 
theory. The reader interested primarily in the results (resp. in the application to RG 
theory) should read up through the statement of Theorem 4.9, and then skip directly 
to Section 4.4.3 (resp. to Section 4.4.4). 
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4.4.1 Non-Gibbsianness of the Sign Field of an (An)harnionic Crystal 

Consider a system of real-valued spins {(fix}xei.''7 ^^^ define a^ = sgn[(fix)-^^ Clearly 
{o'x}xei'' i^ ^ field of Ising spins. We shall show that for certain massless Gibbs measures 
on the system of {ip} spins, the projection of such a measure on the {a} spins is non- 
Gibbsian. 

The measures we have in mind are those possessing a spontaneously broken global 
shift symmetry (^ — )• (^ + c. More precisely, consider a system defined formally by the 
Hamiltonian 

H{v) = \Ey^yiv^-Vy)^ (4-40) 

x:^y 

where the functions Vxy are even, and Vxy = Vx+a,y+a for all x, j/, a G Z . Such a system 
is termed an anharmonic crystal (or if the functions Vxy are all quadratic, a harmonic 
crystal). [More rigorously, such a system is defined by the interaction 

$^((^) = f'^-yiv^-^y) iiA={x,y} ^^_^^^ 

1 otherwise 

where the a priori measure dfi^[(fix) is taken to be Lebesgue measure. Lebesgue mea- 
sure is not normalizable, but if the potentials Vxy are chosen suitably, then one has 
-^a(')^a=) < oo, and the specification is then well-defined. In the infinite-range case there 
are some subtleties associated with rapidly growing boundary conditions, as discussed 
in Example 4 of Section 2.3.3.] 

For an (an)harmonic crystal, an infinite- volume Gibbs measure need not exist; 
and indeed, it will not exist in low enough dimension, e.g. d < 2 for short-range 
interactions [39, 92, 133]. However, if a Gibbs measure fi does exist, then it possesses 
a spontaneously broken global shift symmetry in the sense that Tcfi is also a Gibbs 
measure for the same interaction (here Tc is the map that shifts all spins by a constant 
c), but Tcfi 7^ fi for c ^ 0. That Tcfi is a Gibbs measure is an immediate consequence 
of the DLR equations, while Tcfi ^ fi follows from the impossibility of the probability 
distribution of (fo being invariant under a non-trivial shift. Further information on the 
properties of (an)harmonic crystals can be found in references [39, 42]. 

Fvery harmonic crystal is a massless Gaussian model, and the converse is very 
nearly true. To see this, consider a translation-invariant Gaussian measure // on R^ 
with mean m and covariance 

{ipx;ipy) = Cxy = (27r)-'^ / d{p)e'-'^<^-yUp, (4.42) 



^^Strictly speaking we must define a^ also in the ambiguous case ip^ = 0. The simplest choice is to 
set ax = -|-1 by fiat; the most elegant choice is to set ax = ±1 with probabilities ■^. However, this 
choice will in fact play no role, as every measure that we will consider has the property Prob(i^j; = 
for at least one x) = 0. 
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where c{p) is a nonnegative, even, integrable function of p G [— tt, tt]''. Now let us define 

B,y = (27r)-'^ / dip)-' e'P<^-yUp , (4.43) 



assuming that c[p) ' is an integrable function of p. Then B is the inverse matrix of 
the covariance matrix C . Now, suppose that 

J2\B^y\ < TO , (4.44) 

y 

as will occur if c[p)~' is at least modestly smooth. ^^ In that case, there is a well-defined 
specification corresponding to the formal Hamiltonian 

Hi'f) = IJL^^y'f^'fy - hJ2f^ (4-45) 

with h = m/c[0) and a priori measure taken to be Lebesgue measure; and // is a Gibbs 
measure for this specification. In particular, if c(0) = oo — this is the "massless" case 
— then the couplings B^y satisfy 

J2Bxy = 0. (4.46) 

y 

This means that the Hamiltonian can be rewritten as 

H{^) = lEB^yi^^-^yf (4.47) 

x:^y 

(note that here h = 0). Thus, every massless Gaussian measure (satisfying mild reg- 
ularity conditions) is the Gibbs measure for some harmonic crystal, and conversely. 
Further details on the Gibbs representation of Gaussian measures can be found in 
references [86, 227] and [157, Chapter 13]. 

Now let fi be any translation-invariant Gibbs measure of the (an)harmonic crystal, 
and let Jl be its Ising projection. Under mild technical conditions on the potentials 
Vxy^ we will prove that /I is a non-Gibbsian measure. The basic idea of the proof is to 
use the spontaneously broken shift symmetry to show that 

Prob^((^^ > for all x e A) > e'^d^D (4.48) 

as A y oo (van Hove). That is, the probability that all the spins in a region A are 
simultaneously positive is exponentially suppressed at a rate slower than the volume 
of A (roughly speaking, it is suppressed by a "surface term"). This means that 

t{6+\p.) = , (4.49) 



®®A necessary condition for ^ \Bxy\ < oo to hold is that c(p) ^ be a continuous function of 
p G [— TT, tt]''. However, this condition is not sufficient; for some sufficient conditions in the case d = 1, 
see [108, Section 10.6] and [211]. 
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where 6'^ is the delta measure concentrated on the configuration with all spins +. If /I 
were Gibbsian, then by Proposition 2.67, 6~^ would have to be Gibbsian for the same 
interaction. But 6'^ is obviously non-Gibbsian (no absolutely summable interaction 
can force a spin to be +), so /I must also be non-Gibbsian.^^ 

The proof of (4.48) proceeds in three steps: 

Step 1. Using the DLR equations, one proves the identity 

F{Lp)diJ.{Lp) = [ F{^ + kxA)e-^^^'^^+''''^'^Uii{^) (4.50) 



for any bounded function F and any Gibbs measure //. Here A is an arbitrary finite set 
of sites, xa is its indicator function, k is an arbitrary real number, and Hrei denotes the 
energy difference between the two configurations. (Since the two configurations differ 
on a finite set of sites, this energy difference is finite /i-a.e.) In essence, this identity 
says that a configuration Lp + kxA has a probability e~^''^-'^^~^^^^'^> times as large as 
that of the configuration (p. 

Step 2. One estimates the energy difference Hj-eh ^"^^ attempts to remove the factor 
g-Hrei fj-oni the right-hand side of (4.50) at the price of a prefactor e~°^'^'> . 

Step 3. Specializing to the case F{(p) = x{^ > on A), one attempts to prove a 
lower bound on JF{Lp -\- kxA) d/^i^) that is of the form e~'^'^^^\ where f{k) — )• as 
k — )• -|-oo. Since the left-hand side of (4.50) is independent of A;, we can take k — )• -|-oo 
and thus complete the proof. 

Unfortunately, Steps 2 and 3 are slightly tricky (though not terribly complicated), 
and the details of the proof depend on the exact form of the potentials V^y. In fact, 
we have three distinct proofs, each one valid for a distinct class of Vxy'. 

(a) Each V^y is convex, and Y, ||K)'^l|oo < oo- 

(b) Each Vxy is quadratic (of either sign), and the measure // is a massless Gaussian 
satisfying (4.44) and (4.46). 

(c) Each Vxy is convex, the model is finite-range (i.e. only finitely many of the Vqz 
are nonzero), and the model is dominated by a stable Gaussian in the sense that 
the vectors {z: inf V^''((^) > 0} span a subspace of R of dimension > 2. 

(We conjecture that these technical conditions can be removed or at least weakened.) 
Case (a) is the easiest case, as the energy shift is uniformly bounded; unfortunately, 
the sup norm condition on V does not allow potentials growing faster than linearly at 
infinity (such as Gaussians!). All the technical details in cases (b) and (c) are attempts 



^"^Alternative argument: liji were Gibbsian, then by Proposition 2.67, (5+ would have to be Gibbsian 
for the same interaction, and moreover i(jl\6'^) would have to be zero. But in fact i(jl\6'^) = cxd. Second 
alternative argument: If Ji were Gibbsian, then by Proposition 2.59, the pressure p(g\Ji) would have to 
be strictly convex in directions g = f^ ^ 2 + const arising from interactions $ G Z?^. But for g(o') = ctq 
(i.e. a magnetic field), it is easy to see from (4.48) that p(Xg\Ji) = A for all A > 0, contradicting the 
strict convexity. (For a more general version of this latter argument, see Section 4.4.2.) 
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to control an energy shift that is bounded only in some average sense. We urge the 
reader to study first the proof for case (a), before proceeding to cases (b) and (c). 
Case (b) is the one treated by Dorlas and van Enter [103]; we follow their proof almost 
verbatim. Case (c) is a minor generalization of the one treated by Lebowitz and Maes 
[239]; the proof we give is slightly different from theirs, but the underlying ideas and 
tricks are the same. 

Theorem 4.9 Let fi be a translation-invariant Gibbs measure for an (an)harmonic 
crystal satisfying one of the conditions (a)-(c) listed above. In case (c), assume in 
addition that fi is symmetric around its mean. Then for each M < oo, we have 

Prob^((^^ > M for all x ^ A) > e'^d^D (4.51) 

as A /^ oo (van Hove). It follows that Jl, the projection of fi on the Ising spins 
CTx = sgn[(fix), is not the Gibbs mea.sure for any interaction in B^ . 

Remarks. 1. One consequence of this theorem is that an arbitrarily weak pertur- 
bation of the form H ^ H — Y,x fi^x)} where / is nondecreasing and nonconstant, 
will drive the spins (p^ to +00. As a result, thus the perturbed model will have no 
infinite- volume translation-invariant Gibbs measures. This is the phenomenon of en- 
tropic repulsion of a surface by a soft wall, studied by Lebowitz and Maes [239]. 

2. It is natural to ask whether fl is non-quasilocal (and not merely non-Gibbsian). 
We discuss this question, in somewhat greater generality, in Section 4.4.3. 

Proof of Theorem 4.9. Since the hypotheses of the theorem are invariant under 
a uniform shift (^ — )• (^ -|- c, it suffices to consider the case Af = 0; this lightens the 
notation. (For our application to the sign function, we need only Af = anyway. But 
we will exploit the formulation with general M in Section 4.4.2.) 

Step 1. Let fl be any Gibbs measure for any model of real- valued spins (not neces- 
sarily an anharmonic crystal). Then the DLR equations for volume A say that 

dM^Al^Ac) = ZA((^Ac)-ie-^*('^*''^*^)j(^A. (4.52) 

Now let F be any bounded (for simplicity) measurable function, and let t/j be any field 
which vanishes outside A. Then 

F{p)dMpA\vAc) = ZA((^Ac)-iy'F((^)e-^*('^*''^*^)j(^A 

= ZA((^Ae)-' / F{p + V) e-^A('^A+'^A''^A=) J(^A 

F{p + t/j) e-[^A('^A+^A,'^Ac)-ffA('^A,'^Ac)] J^^ ((^^ | (^^, ) ^ 

(4.53) 

where in the middle line we used the shift invariance of Lebesgue measure. Now 
integrate over dfiAc{(fiA^)- we obtain 

F{^)dfi{^) = /"F((^ + V')e-^-'('^+'^''^)j/i((^) , (4.54) 
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where 

Hreli^ + 4^;^) = HA{(pA + tpA,(pAc) - Ha{(PA,(PAc) . (4.55) 

Note that Hrei is independent of A as soon as A Z) suppt/j. The identity (4.54) is thus 
vahd for any t/j of bounded support. In particular, if we take t/j = kxA^ we obtain 
(4.50). In the case of the anharmonic crystal (4.40) we have the following expression 
for Hrel- 

Hreii^ + kxA] ^) = Yl [^^y{^^ + k - ^y) - V^y{(p^ - (py)] (4.56a) 

X e A 
ye A-' 



E I V;',(^.-^, + V')#. (4.56b) 



X ^ A 

y eA^ 



Step 2. The goal of this step is to prove that 

F{p)dii{p) > e-°^^^^\^ ( F{p + kxA)dii{p) (4.57) 



(or some similar formula) for some suitable class of nonnegative functions F . Here 
o/;(|A|) denotes a term that may depend in an arbitrary way on A;, but for each real k 
it should be o(|A|) as A /^ oo. 

Case (a): This is the easy case, as the energy shift (4.56) can be bounded in sup 
norm: 

I I 7T II ^ \ ^ I 7 I II T^/ I I 

||-nre/||oo ^ / J \'^\\\'^xy\\oo 

X ^ A 
y eA^ 

< |A;|o(|A|) asA/oo (4.58) 

by the usual argument based on Y^ \\^oz\\oo < oo (see e.g. the proof of Proposition 2.45 
in Appendix A. 3. 8). Substituting (4.58) into the identity (4.50), we conclude that 

F{ip)dfi{ip) > e-\'\°^\^\^ f F{ip + kxA)dfi{ip) (4.59) 



uniformly for all nonnegative bounded functions F. 

Case (h): Here we apply the Schwarz inequality to the right-hand side of the identity 
(4.50): 

„ . , ^ \.l' F{Lp + kxAY'^dljiLp)Y 

F{p + kxA)e-''-^^-+'^--'-Ufi{p) > ^\ Z , It ,,r< (4-60) 

' ' ' J e+^rei(v+''>^A;v) dji{(fi) ' 

for any F > 0. In particular, if F is the indicator function of some set, then F^'^ = F. 
Now in case (b) we have 

Hreii'-p + kxA^'-p) = k{Lp,BxA) + —{xa.Bxa) , (4.61) 
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and /i is a Gaussian measure with mean m and covariance matrix C = B ^ . We can 
therefore calculate exactly 

y'g+//.eK'.+fcx.;^)j^(^) = eMk\xA,BxA) + km{l,BxA)] 

= e^^[k\xA.BxA)] (4.62) 

since 51 = by (4.46). Now 

k^XA.BxA) = 2^ J2 B,y 

X ^ A 
y eA^ 

< k'^o{\A\) asA/oo (4.63) 

by the usual argument based on Xl l-^o^l < oo. Hence 

^^0 



F{^)dn{^) > e-'^'^d^l) 



2 

F{Lp + kxA)diJ.{Lp) 



(4.64) 



uniformly for all indicator functions F. This is a slight variant of (4.57). 

Case (c): This case is a little bit trickier. Let F be any nonnegative function 
supported on the set {(^: a < (^ < h on A and a' < (^ < 6' on df A], where r is 
the range of the interaction. Then on the right-hand side of (4.50) the integrand is 
nonvanishing only when a — k < (^^< h — k for x G A, and a' < (py < b' for y G d^A. 
Now, since V^y is convex, V' is increasing, so Vxy{(fix ~ Vy ~'t~ k) — Vxy{(fix ~ Vy) is an 
increasing (resp. decreasing) function of (p^ — ^y for k > (resp. A; < 0), as seen from 
(4.56b). Therefore, for k > (which is the case that will interest us) we have 

Hreiip + kxA]^) < Yl [Vxyib - a') - V^y{b - a' - k)] 

X G d~ A 

y e 9;J"A 

< Cia',b,k)\d;A\, (4.65) 

where C{a\ 6, k) = Y,z[^Oz{b — a') — Voz{b — a' — k)] is finite for all a', 6, k (since only 
finitely many terms in this sum are nonzero). Hence 

F{if)dfi{if) > e-^(»''^''=) 19-^1 [F{if + kxA)dfi{if) (4.66) 



uniformly for all nonnegative F satisfying the support condition. This, too, is a variant 
of (4.57). 

Step 3, Case (a): We apply (4.59) to F((^) = xi't' > on A), so that F{Lp + kxA) = 
x{(fi > —k on A). Since the V^y are convex, it follows immediately from the DLR 
equation that the FKG inequality [24] holds for every Gibbs measure //. Therefore we 
have 

{Xiip > -k on A)), > n {Xi^^ > -k)), = Prob,((^o > -k)^^^ (4.67) 

xeA 
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Combining this with (4.59), we get 

hminf — — log Prob^((^ > on A) > logProb^((^o > —k) . (4.68) 

A/'oo \A\ 

But taking k — )• +oo, the right-hand side goes to zero. 

Step 3, Case (h): We apply (4.64) to F{Lp) = x{^ > on A). We control Prob^((^ > 
— k on A) using the Brascamp-Lieb inequality [37, 38], which is valid for arbitrary 
Gaussian measures, combined with the Chebyshev inequality: 

Prob^((^ > —k on A) > Prob^(|(^ — m\ < m -\- k on A) 

n 

= Yi Prob^f \lPx^ —ml < m -\- k \Lpxj — m\ < m -\- k for 1 < j < i 

8 = 1 

[where A = {xi, . . . , a;„}] 

E^[{lP^^ - mf \lp^^ - m| < m + A; for 1 < j < zj \ 



> n 1 



8=1 V irn + kf 

[by Chebyshev] 



" / var,, cc^. 
J^ 1 ^^^"' 



I (^j; — m\ < m -\- k for 1 < j < i 



8 = 1 



{m + kY I 

[since conditioning a Gaussian on a set 
symmetric about the mean preserves the mean] 



> nfi ""^-(^ 



8=1 \ (^ + ky J 

[by Brascamp-Lieb] 

Combining this with (4.64), we get 

liminf^logProb„((^ > on A) > 21og|l-- ^^| . (4.70) 

A/oo |A| ^ ^'^ J - t. y ^^j^j^yj V ^ 

Now take k — )• -|-oo. 

Step 3, Case (c): We apply (4.66) to F{(^) = x(a < (^ < 6 on A and a! < (^ < 
y on 9+ A), with the choices a = 0, b = 2m + 2k, a' = -k, b' = 2m + k, k > 0. We 
therefore need to control 

Prob^(a — k<Lp<b — k on A and a' < Lp < b' on d^A) 

= Prob^(|(^ -m\<m + k on AU 9+ A) . (4.71) 
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To do this, we employ the Brascamp-Lieb and Chebyshev inequahties as in case (b) [the 
Brascamp-Lieb inequahty is vahd because all the V^y are convex]. Here it is important 
that fi be even about its mean, because Brascamp-Lieb refers to variances rather than 
to expectations of squares; we need to know that conditioning // on a set symmetric 
around the mean does not displace the mean. The only other change from case (b) is 
that vaT^[(fix,\ ■ ■ ■ ) is bounded above not by var^((^2;J, but rather by the variance of 
(fix, in the dominating Gaussian, which by hypothesis is finite (call it Coo)- Thus, we 
have 

/ c \ ^^^^^^^ 

Prob„(|(^-m| < m + A; on Aua+A) > 1-- ^ . (4.72) 

y (m + k) / 

Combining this with (4.66), we get 
liminf — — log Prob^((^ > on A) 

A/'oo \A\ 

> lim inf — — log Prob^(0 < fi < 2m -\- 2k on A and — k < fi < 2m + A; on d^A) 

A/co \A\ 

Now take k — )• +00. ■ 

Remark. We do not know whether there can exist translation-invariant Gibbs 
measures for the anharmonic crystal that fail to be symmetric around their mean. (In 
the Gaussian case such measures cannot exist.) That is, we do not know whether 
the refiection symmetry can be spontaneously broken. If the answer is no, then our 
additional hypothesis in case (c) is superfiuous. 

The technical condition in case (c) — that the model be dominated by a stable 
Gaussian — unfortunately excludes some interesting models, such as the {^fiY model. 
The need for this technical condition arises from the use of Brascamp-Lieb inequalities 
to bound the conditional probability Probed (^2;^ — ?7i I < mi-\-k\ \fix^ —mi\ < mi-\-k iov 1 < 
i < i). An alternate approach would be to use the FKG inequalities as in case (a), but 
then we would be forced to work with increasing functions, i.e. to take b = b' = -|-oo. 
Unfortunately, the cutoff b < 00 was necessary in case (c) in order to control the energy 
shift Hreh which otherwise could be unbounded above. 

How can we escape from this dilemma? Let us first note that the large energy 
shift arises from applying the shift fi^ ^)- fix + k to fields fix that are already large and 
positive, hence have no need to be shifted farther upwards in order to bring them above 
the level fi = 0. This suggests that instead of applying a uniform shift fix ^)- fix + k in 
the region A, we should apply a nonlinear map fix — > f{fix) that would produce a large 
upward shift when fix is negative, but a smaller shift when fix is large and positive. In 
this way we may hope to have an energy shift that is uniformly bounded above. Of 
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course, in the case of a nonlinear map / we must also deal with a Jacobian, but this 
turns out to be manageable. The idea may be crazy, but it seems to work, at least for 
some rather large class of potentials V^y. However, this paper is already much too long, 
and we have not had time to work out all the details, so we leave further development 
of this circle of ideas to the interested reader. 

4.4.2 Non-Gibbsianness of Local Nonlinear Functions of an (An)harnionic 
Crystal 

The method of the preceding section applies, in fact, to local nonlinear functions much 
more general than the sign. Indeed, let Oq be a compact metric space, and let /: R — > 
Oq be any function (not necessarily continuous) such that lini,^^^co f{v) = ^* exists. 
We shall show that for the class of massless Gibbs measures on the system of {ip} spins 
considered in the preceding section, the projection of such a measure on the {io} spins 
is non-Gibbsian. 

Theorem 4.10 Let ji he any translation-invariant measure on R satisfying the es- 
timate (4-51) for all M < oo. Let Oq he a compact metric space, and let /: R — )• Oq he 
a function (not necessarily continuous) such that linit^^+oo f{^) = ^* exists. Let Jl he 
the image measure of fi under the map f applied to each spin. Then /I is not the Gihhs 
for any interaction in B^ , with respect to any a priori mea.sure .supported on more than 
one point. 



Proof. Let [/, V be open sets in Oq satisfying to* ^ U C U CV . Then let Qq: Vl'^ ^ 
[0, 1] be a continuous function satisfying go\U = 1 and go IV^ = 0; the existence of 
such a function is guaranteed by Urysohn's lemma. Now define g: Oq — )• [0, 1] by 
g{{'jJx}xei'^) = S'ol^o)- That is, g is the function go applied to the spin at the origin. 
Now let us compute the pressure p{\g\'jj.) for A > 0: 



p{\g\lji) = lim n log / exp 
= lim n~ log / exp 

ri^oo J 



xec'n 



dfi[ 



iO 



^ J2 goifi'fx)) 

xeC'n 



diJ.{Lp) 



(4.74) 



if this limit exists. Since go < 1, clearly the lim sup is < A. On the other hand, the 
lim inf is 

> liminf n-'^log [e^'^'Prob^(/((^^) G U for all x G C„)] 

> liminf n-'^log [e^'^'Prob^((^^ > M for all x G C„)] 

= A, (4.75) 
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where M is chosen so that (p > M imphes f{(fi) G U; here the final equahty uses the 
fundamental estimate (4.51). So we have 

p{Xg\ll) = X for all A > . (4.76) 

But this violates the strict convexity of the pressure which must hold if /I is a Gibbs 
measure for an interaction in B^ (Griffiths-Ruelle theorem, Proposition 2.59). Hence 
/I is non-Gibbsian. ■ 



4.4.3 Physical Interpretation 

We have proven that /I is not the Gibbs measure for any interaction in B^^ but is 
this enough? We know that non-Gibbsianness can sometimes occur for "trivial" rea- 
sons, e.g. if there are hard-core exclusions, or for "semi-trivial" reasons, e.g. if the 
Hamiltonian i7^ is quasilocal but unbounded. (This latter can happen only when the 
single-spin space is infinite.) If we contend that Jl is "pathological", then we really 
ought to prove not merely that /I is non-Gibbsian, but also that it is non- quasilocal . 

We are not able at present to prove non-quasilocality, but we can argue heuristically 
that in at least some cases the non-Gibbsianness does involve some strongly non-local 
effect. Consider the sign of the (an)harmonic crystal. Recalling Theorem 4.9 together 
with Remark 1 following it, it is natural to conjecture that 

lim EA sgn((^o) 't'x > ^ for all x having R <\x\< E' \ = 1 (4.77) 

for all i?, no matter how large. (At least in case (c) of Section 4.4.1, we are able to prove 
this using the FKG inequality, via a slight extension of the arguments of Lebowitz and 
Maes [239].) That is, if we condition on the spins in an annulus R <\x\< B! being all 
> 0, as i?' — )• oo this drives all the spins to -|-oo, and in particular forces the sign of the 
spin at the origin to be -|- (with probability l!). For the Ising measure /I, this means 
heuristically that the spin at the origin is feeling an infinite energy. However, since 
the effect occurs for all i?, no matter how large, this infinite energy must arise from 
the interaction between the spin at the origin and arbitrarily distant spins. (Crudely 
speaking, the interaction, if it exists, is non-summable.) Thus, we do not have here 
merely the "semi-trivial" situation of a Hamiltonian which is quasilocal but unbounded 
(which anyway is impossible for a model with finite single-spin space); some strongly 
non-local effect is taking place. It may even be that (4.77) implies non-quasilocality; 
or it may be that non-quasilocality can be proven by a different argument. These are 
open questions. 

A similar situation probably holds in the setup of Section 4.4.2, whenever the image 
single-spin space Oq is finite. 

A very different situation arises if / is a bijective map of R onto Oq (of course Oq 
must then be uncountably infinite!). In this case / is merely a one-to-one relabelling 
of spin values; the physics of the image measure /I is obviously identical to that of the 
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original measure //. In particular, if the original (an)liarmonic crystal has finite-range 
interactions, then /I is consistent with a Gibbsian specification for a particular finite- 
range but unbounded interaction, namely the one gotten by mapping the (an)harmonic- 
crystal specification via the function /. Such a specification is always quasilocal; the 
interaction is uniformly convergent but not absolutely summable. 

Finally, let us remark that the local nonlinear maps considered here are a spe- 
cial case of the renormalization transformations considered in Sections 3 and 4.1-4.3: 
namely, one in which the blocks are single sites, the transformation is deterministic, 
and the image space is in general different from the original space. Such transforma- 
tions trivially obey properties (T1)-(T3) of Section 3.1. Of course, if / is one-to-one, 
then the transformation is trivial (just a relabelling of spin configurations). However, 
if / is many-to-one, then the transformation is not so different in nature from the usual 
(block-spin) renormalization transformations: both "discard details" from the original 
spin configuration. These details may be in the fine structure of a single spin, or in 
the local fine structure of a small block of spins, but qualitatively there does not seem 
to be any great intrinsic difference. Our theorems both in Sections 4.1-4.3 and in the 
current subsection are of the general type: an RT map which discards (important) 
information makes the image measure (sometimes) non-Gibbsian (and possibly even 
non-quasilocal). 

4.4.4 Application to the Renormalization Group 

In this section we apply Theorem 4.9 to the RG, following closely Dorlas and van 
Enter [103]. Let us consider an Ising model in dimension J > 4 at the critical point, 
and apply block-averaging transformations on various block sizes b. Then DeConinck 
and Newman [71] and Shlosman [323, and private communication] have shown that 
there exists a 6-dependent choice of normalization such that the block-spin measures 
converge as 6 — )• oo to a massless Gaussian measure^®; this is a slight variant of the 
Aizenman-Frohlich triviality theorem. 

Now the key observation is that a block-averaging transformation followed by a 
projection on Ising configurations is identical to a majority-rule transformation. So 
consider applying the majority-rule transformation using larger and larger block sizes 
b. Since the block-averaged spins (with a suitable 6-dependent normalization) converge 
as 6 — )• oo to a massless Gaussian, it is not difficult to show that the majority-rule image 
spins converge as 6 — )• oo to the sign of this same massless Gaussian. But by Theorem 
4.9, this latter measure is non-Gibbsian! (For details, see [103].) 

This non-Gibbsian scaling limit is not a fixed point in the strict sense, as the 
sequence of majority-rule transformations lacks the semigroup property: the majority 



^^Conventional wisdom holds that the normalization can be chosen to be b~P for a suitable power 
p [in fact one predicts p = (d + 2 — ?y)/2 = (d + 2)/2]. If this is the case, then the limiting measure 
can also be obtained by repeated application of the block-averaging transformation with a fixed block 
size h, and hence is a self-similar Gaussian measure [327, 17, 32]. However, this conventional wisdom 
has not yet (as far as we know) been proven rigorously. 
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rule on block size P is not equal to the second iteration of majority rule on block size 
b (as politicians well know!). Therefore, the existence of pathologies for the fixed point 
arising from the 6 — )• oo limit does not guarantee that the corresponding pathologies 
will occur for the fixed point arising from iteration of a majority-rule map with a 
fixed block size b. But it does make it plausible: there does not seem to be so much 
difference between majority rule on a block of size 6" and n iterations of majority 
rule on a block of size b. And, in any case, the "large-cell majority-rule" approach 
is clearly part of the RG enterprise [f28, 249], so it is interesting to see that it can 
fail. Finally, as we discuss in Section 5.2, there are other reasons to expect that this 
behavior is in some sense typical. Indeed, we conjecture that the fixed-point measures 
of nonlinear RG transformations for d ^) du (= upper critical dimension of the model) 
will be non-Gibbsian in considerable generality. 

Finally, we remark that the results discussed here for J > 4 are expected to hold 
also for J = 4, provided that the "triviality conjecture" [365, 115] is true. 

4.5 Other Results on Non-Gibbsianness and Non-Quasilocality 

In Sections 4.1-4.4, we have given a number of examples of non-Gibbsian (or what is 
slightly stronger, non-quasilocal) measures, with particular attention to those arising in 
RG theory. It is natural to ask whether the phenomenon of non-Gibbsianness (or non- 
quasilocality) is more widespread. Unfortunately, very little is known at present about 
the properties of non-quasilocal measures, and very few examples of non-quasilocal 
measures are known. In this section we try to make a complete survey of all known 
physically interesting examples of non-quasilocality. (The list is short enough that such 
a comprehensive survey is feasible.) 

4.5.1 Trivial Example: Convex Combination of Gibbs Measures for Dif- 
ferent Interactions 

These are perhaps rather silly examples: if one makes a convex combination of Gibbs 
measures for the Ising model at two different temperatures, then it is hardly surprising 
that the resulting measure will not be Gibbsian at all. The proof says roughly that if the 
resulting measure were Gibbsian for some interaction $, then the two original measures 
would also have to be Gibbsian for $. But this is impossible, because the Griffiths- 
Ruelle theorem tells us that a measure can be Gibbsian for at most one interaction 
(modulo physical equivalence). 

We need a preliminary result, concerning the conditions under which a "reweight- 
ing" of a Gibbs measure remains a Gibbs measure: 

Lemma 4.11 Let H be a specification and fi a measure in ^(H). A measure v oj the 
form. V = ffi belongs also to Q{Ii) if and only if f is J-'oo-m.easurable (modulo fi-null 

sets). 



139 



(We recall that Too = f] ^A^ is the cr-field of observables at infinity: see Section 

Aes 
2.3.6.) The proof of this lemma is given, for instance, in [299, Lemma 2.4] and in [157, 

Theorem 7.7]. 

We can now prove the main result: 

Proposition 4.12 Let /ii,/i2, . . . be a finite or countahly infinite family of measures 

(not necessarily translation-invariant) which are are distinguishable at infinity, i.e. 

there exist disjoint sets Fi,F2,--- £ ^oo such that fik{Fk) = 1 for each k. Assum.e 

further that each of the mea.sures /ii,/i2, • • • gives nonzero mea.sure to every open set in 

0. Now form a convex combination fi = Y.'^kf'k with all c^ > 0. If fi is consistent with 

k 
a .specification H, then so are /ii,/i2, • • •/ ^'^^ (f n is Feller, then this is the only Feller 

.specification with which any of these mea.sures is consistent. 

Thus, if some two of the {fik} — say, fii and fij — happen to be consistent with 
different Feller specifications [Hi ^ IIj), then it follows that fi is not consistent with 
any Feller specification. In particular, fi is not a Gibbs measure for any continuous, 
uniformly convergent interaction. If the single-spin space Oq is finite, this means that 
fi is not consistent with any quasilocal specification, and in particular that fi is not a 
Gibbs measure for any uniformly convergent interaction. 

Remark. It is not difficult to show that if the measures /ii,/i2,--- are pairwise 
distinguishable at infinity, then they are jointly distinguishable at infinity in the sense 
of Proposition 4.12. Here it is crucial that we are dealing with a countable family. 

Proof of Proposition 4.12. Suppose that fi is consistent with a specification 
n. Then, by Lemma 4.11, the measures fik = c^ Xf^I^ are also consistent with 11. The 
uniqueness follows from Theorem 2.15. ■ 

In order to apply Proposition 4.12, we need to verify that the measures /ii,/i2, • • • 
are distinguishable at infinity (the support hypothesis is usually trivial to check). One 
easy way to obtain such measures is to recall that distinct ergodic translation-invariant 
measures are distinguishable at infinity (Theorem 2.33 and the remark following it). 
We therefore have: 

Corollary 4.13 Let fii^fi2^ . . . be a finite or countably infinite family of ergodic translation- 
invariant Gibbs mea.sures for interactions $i,$2,--- £ B^ , respectively. Now form a 

convex combination fi = Y.'^kf'k with all c^ > 0. If fi is consistent with a Feller .specifi- 

k 
cation H, then all the interactions $/; mu.st be physically equivalent in the DLR sense 

(and hence also in the Ruelle sense). 

Proof. If //j- = //j, then $i and $j must be physically equivalent in the DLR sense 
(Corollary 2.18). So we can assume without loss of generality that the measures 
/ii,/i2,--- are all distinct. Since distinct ergodic measures are distinguishable at in- 
finity, and Gibbs measures for an absolutely summable interaction always give nonzero 
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measure to every open set, we can apply the preceding proposition to conclude that 
n = n ^=n ^ = .... The rest follows from Theorems 2.17 and 2.42. ■ 

Therefore, (non-trivial) finite or countahly infinite convex combinations of ergodic 
translation-invariant Gihhs measures for non-physically-equivalent interactions cannot 
be Gibbsian; and for finite single-spin space they cannot even be quasilocal. 

4.5.2 Restriction of the Two-Dimensional Ising Model to an Axis 

Schonmann [318] gave another example of a non-Gibbsian measure that can be obtained 
by applying a simple transformation to a well-known Gibbsian measure. He proved 
that if /i_|_ is the "-|-" phase of the two-dimensional Ising model at zero field and at 
any temperature below critical, then its restriction fi+P to the axis {(z,0): z G Z} is a 
non-Gibbsian one-dimensional Ising model. His argument is based on two results: 

Rl) For all temperatures below the critical temperature for the d = 2 Ising model, 
z(/i_P|/i+P) ^0. 

R2) Let cr^ jv denote the spin configuration on the "annulus" {(z,0): n < |z| < N}. 
Then for each n there exists an N[n) such that 

K ■ Wn,N{n) = -1) ^ /^- as n ^ oo (4.78) 

for all Gibbs measures fi of the original model. ^^ As a consequence 

{fi+P){ ■ \(Jn,N(n) = -1) -^ l^'-P as n ^ oo . (4.79) 



Result (4.79) implies that if /U+P is consistent with some quasilocal (= Feller) 
specification, then /i_P must be consistent with that same specification. Heuristically 
this is due to the fact that a measure obtained just by a change in the boundary 
conditions must be a different phase for the same interaction. To see it mathematically, 
let H = (tta) be a quasilocal specification with which /U+P is consistent. Then for each 
set A contained in the interval (— n, n) we have that 

(/i+P)( • \(j'^^N(n) = -1) TTA = {l^+P){ ■ Wn,N{n) = "l) (4-80) 

by property (b) of Definition 2.5; and passing to the limit n — )• oo (since H is Feller) 
we obtain 

(/i_P)7rA = /i_P . (4.81) 

Therefore, if fi+P were a Gibbs measure for some (uniformly convergent) interaction, 
then so would be /i_P. But this contradicts the result (Rl), because Gibbs measures 
for the same (absolutely summable translation-invariant) interaction have zero relative 
entropy density. 



®^This statement easily follows from Schonmann's Lemma 1. 
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Schonmann's restriction P does not fit into the framework considered in Section 
3, because tlie voiume compression factor K is not finite (see Exampie 7 in Section 
3.f). On tlie otlier liand, Sclionmann's proof of non-Gibbsianness seems to be ratlier 
different from our proofs in Sections 4.1-4.3. We sliow liere tliat, nevertlieless, liis 
result can be obtained by following basically the steps discussed in Sections 4.1-4.3 
(although at present we are able to do it only for temperatures low enough). This will 
prove that fi+P is not merely non-Gibbsian, but in fact non-quasilocal. The proof will 
use (R2) but not (Rl). 

In our language, the image spins for this transformation are the spins on the hori- 
zontal line, and the internal spins are all the spins of the plane except those of the line. 
We first notice that Schonmann's result (R2) corresponds exactly to our Step 2: that 
is, (4.78) shows that the annulus [—N^ —n] U [n, N] of image spins selects the phase of 
the internal spins. Physically, this is a kind of wetting phenomenon: imposing — spins 
on a large segment of the axis (of size ~ N) give rise to a droplet of the — phase in a 
neighborhood of the axis, even when the bulk boundary conditions are -/-; as A^ — )• oo 
the width of the droplet grows to infinity, and moreover the left and right droplets join, 
thereby enforcing the — phase throughout the infinite system. 

We sketch now how our Step 1 can be proven via a contour argument, so that 
we obtain the non-quasilocality of the image system without making use of the large- 
deviation estimate (Rl). We consider the origin unfixed from the start (so Step 3 
is superfiuous), and consider cu^ j^j to be an alternating configuration such that the 
neighbors of the origin are of opposite sign: 

/ / A /(-I)' if « > . . 

(^special) («-,0) = j(_l)Hl if^<0 ■ ^^-^^^ 

We shall prove the following: there exists e > such that for all k there exist n[k) and 
N[k) such that 



/i+(cro cr^ 


1 1 

— '^speciab '^n(fc),JV(fc) - 


= +1) 


> 


e > 


(4.83a) 


/i+(cro a[^. 


— '^speciab '^n(fc),JV(fc) - 


= -1) 


< 


-i<» 


(4.83b) 



It is clear that (4.83a) and (4.83b) together imply the non-quasilocality of /U+P, for 
they show that in an arbitrarily small neighborhood of cu' ^^j^j G { — 1,1}^ (namely, 
A4 = W: o' 



^special}) 5 there exist open subsets 






A/'fc,+ = {a: cr^ = cu^pe^iai and cr^^) jv(fc) 


= +1} 


(4.84a) 


Mk- = W- cr'i^k = ^special and cr^^^ jv(fc) 


= -1} 


(4.84b) 



such that the (/i_|_P)-average value of E^_^_p[crQ\{cr'^}x:^o) over Afk,+ (resp. f^k,-) is > e 
(resp. < — e/2). This is incompatible with E^_^_p[crQ\{cr'^}x:^o) having any continuous 
(= quasilocal) version. 
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In order to prove (4.83a) and (4.83b), we shall prove the following intermediate 
result: there exists e > such that 

f^+{M(T[^k = ^'specie.!) > ^ (4-85) 

for all k. This trivially implies (4.83a), by the FKG inequality, for any choice of n and 
N. To see that it also implies (4.83b), we use (4.78) with fi = fi^ and applied to the 
functions 

fk = X{(^'l,k = ^special) (4.86a) 

9k = (To X{(T'l,k = ^special) (4.86b) 

We obtain 

Jim /i+(/fc|a;^jv(n) = -1) = f^-ifk) (4.87a) 

limfi+{gk\(j'^^N{n) = -'^) = f^-{9k) (4.87b) 

Dividing (4.87b) by (4.87a) we get 

Jim ^+{ao\(j[k = ^special, (^n,N(n) = "l) = ^-i(^oK,k = ^special) • (4-88) 

Now, by (4.85) and spin-flip symmetry, the RHS is < — e.^° Therefore, for each k there 
exists an n[k) such that 

/i+(cro|cr^^^ = CUgpeciab '^n(k),N(n(k)) = "l) < "g ' (4.89) 

which is (4.83b). 

So now let us prove (4.85) — at low enough temperatures — by a more-or-less 
standard Peierls argument [169]. Here the contours are defined as the boundaries (in 
the dual lattice) of regions where the spins differ from the ground-state configuration 
(that is, all "-|-" except for the required alternating "— "). In counting the energy of such 
contours one must subtract the energy of the contours already existing in the ground 
state (squares surrounding the alternating "— "). After some thought, one concludes 
that the energy of the contours is at least proportional to Ny -\- hNh — 2, where Ny (resp. 
Nfi) is the the number of vertical (resp. horizontal) bonds in the contour. On the other 
hand, the number of possible contours is even less than that for the unconditioned Ising 
model. As in the standard Peierls argument, these facts imply that the probability of 
finding a contour surrounding the origin — that is, of having a "— " at the origin — 
goes to zero as /3 goes to infinity. 



®'^If we apply spin-flip symmetry to (4.85), we not only change //_|_ to //_ and e to —e, but must also 
change w^ j^j to — i^gpeciai- ^^^ ^^i^ latter is just w^ j^j reflected in the a;2-axis (i.e. xi -^ —xi), and 
the measures //_|_ and //_ are invariant under this reflection. 
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Remarks. 1. In contrast to the RG examples given in Sections 4.1-4.3, here the 
non-Gibbsianness occurs only for interactions on the first-order phase-transition curve, 
i.e. zero magnetic field. Indeed, Maes and van de Velde [256] have proven that if either 
h ^ OT l3 is sufficiently small, the restriction of the two-dimensional Ising model to 
an axis is Gibbsian. 

2. It is natural to generalize this example: consider a J-dimensional Ising model 
and a J'-dimensional coordinate plane [1 < d' < d). It seems to be an open question, 
for all cases other than (J, d') = (2, 1), whether the restricted measure is non-Gibbsian 
at low temperatures. 

4.5.3 Fortuin-Kasteleyn Random-Cluster Model 

In 1972 Fortuin and Kasteleyn [127] introduced a correlated bond-percolation model 
which has since become known as the Fortuin-Kasteleyn random- cluster model. For a 
finite graph G = {V^ B) having vertex set V and edge (or "bond") set i?, the model 
is defined as follows: On each bond b there is a variable nj, taking the value ("bond 
vacant") or 1 ("bond occupied"). The probability of a configuration n = {nj,} is defined 
to be 

Prob(n) = const X p^i(")(l - p)^o(n)^c(n) ^ ^^gg^ 

where < p < 1 and q > are parameters; here A/o(n) [resp. A/i(n)] is the number of 
bonds b with n^ = [resp. n^ = 1], and C(n) is the number of "clusters" (i.e. connected 
components of vertices) in the graph Gu whose vertex set is V and whose edges are the 
occupied (n^ = 1) bonds. For q = I this model reduces to ordinary (independent) bond 
percolation, while for integer q > 2 there are identities relating the random-cluster 
model to the q-state Potts model [127, 125, 109]. 

Let us now try to formulate the random-cluster model on a countably infinite graph 
G = iV,B) [for example, V = Z and B = nearest-neighbor bonds in Z ], following 
the DLR approach. The "lattice" is here B, and the configuration space is {0,1}^. 
Let A be a finite subset of i?, and let A* C V be the set of all vertices touching at 
least one bond 6 G A. We need to specify the conditional probabilities of {n^jfegA given 
{'nb'}b'eB\A- But this is easy, by the same method as for spin systems: we write down 
the formal (meaningless) Boltzmann factor for the infinite lattice, and then drop all 
terms that don't involve {n^lfegA- The result is simple: it is 

Vvoh{{nb}beK\{n,'heB\K) = const({n5.}5'eB\A) x p^^("a)(i _^)AAo(n,)^c,,(n) ^ (49^^ 

where A/o(nA) [resp. A/i(nA)] is the number of bonds 6 G A with n^ = [resp. n^ = 1], 
while CA*(n) is the number of clusters containing at least one element of A* ^ in the 
graph whose edges are the occupied (n^ = 1) bonds (both those inside and outside A). 
It is easy to see that (4.91) defines a specification (i.e. it is consistent for different 
A). It is also easy to see that the dependence on {f^6'}6'g_B\A is only via the set of 
answers to the following questions: for each pair x, j/ G A*, one wants to know whether 
X and y can be connected by a path of occupied bonds lying in i? \ A. Note, however, 
that the answer to this question could depend on bonds n^/ arbitrarily far away from 
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A (provided that the graph G contains arbitrarily large closed loops). Therefore, for 
q ^ 1, the specification defined by (4-91) is not quasilocal (as was previously noted in 

[5]). ^ 

Aizenman, Chayes, Chayes and Newman [5] have proven the existence of the 
infinite- volume limit for the "Gibbs" measures of the random-cluster model taken with 
either free (n^/ = 0) or wired (n^/ = 1) boundary conditions. However, since the specifi- 
cation (4.91) is not quasilocal (hence not Feller), it is not immediate that these limiting 
measures fij and //„, are indeed consistent with the specification (4.91) [since Proposi- 
tion 2.22 does not apply], although it seems very plausible. Indeed, it is not clear that 
there exist any measures consistent with the specification (4.91). We therefore pose 
the following open question: Prove that the infinite- volume limit measures taken with 
free or wired boundary conditions are consistent with the specification (4.91). 

Assuming that there do exist measures consistent with the specification (4.91), we 
can now prove that all these measures are non-quasilocal (hence non-Gibbsian). 

Definition 4.14 Let ft be a metric space. We call a function /: — )• R strongly 
discontinuous if every continuous function differs from f on a set having nonempty 
interior. [In detail: for every g G C(0), the set {u: f{io) ^ 5'(^)} has nonempty 
interior.] 

We call a .specification strongly non- Feller if there exists A G iS and f G C(0) .such 
that tta/ is .strongly discontinuous. 

A sufficient condition for strong discontinuity of a function / is the following: there 
exists an cu* G and an e > such that for every neighborhood Af 3 io* there exist 
open sets A/+,A/'_ C Af such that inf /(cu) — sup f{io) > e. 

It is now easy to prove that the specification (4.91) is strongly non-Feller. To avoid 
uninteresting graph-theoretic complexities, we prove the theorem for the special case 
y = Z and B = nearest-neighbor bonds in Z . The reader can easily generalize this 
to a suitable class of countably infinite graphs G. 

Proposition 4.15 Let q ^ 1. Then the .specification (4-91) for the random- clu.ster 
model is strongly non-Feller, when V = ~L and B = nearest-neighbor bonds in Z . 



Proof. Let A be a set containing a single bond bo = {xc^i}, and let /(n) = n^g. 
Now let id* be the configuration which sets n^ = 1 on parallel rays running from Xq and 
Xi to infinity, perpendicular to the bond 6o, and which sets n^ = on all other bonds. 
Now any neighborhood J\f 3 io* (in the product topology) contains the particular 
neighborhood 

AT^ = {n: n = uj* on Ar} , (4.92) 

where A^^ is the set of all bonds in a square of side 2R -\- 1 centered at the origin. We 
then choose A/i?,+ to be the subset of JVr in which an occupied bond in A^^+i \ A^^ 
connects the two parallel rays; and we choose JVr^- to be the subset of JVr in which all 
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the bonds in A^^+i \ A^^ are vacant (so that the two parallel rays cannot be connected, 
no matter what happens outside A/^+i). It is easy to see that 

p for all id G J^R,+ 



(. p+(l-p)q ^' 

for all R. Since < p < I and q ^ 1^ it follows that T^{bo}f is strongly discontinuous. 



Proposition 4.16 (a) Let 11 6e a strongly non-Feller specification, and let fi be any 

measure consistent with 11 that gives nonzero measure to every open set. Then fi is not 
consistent with any Feller specification. 

(h) Let Hhe a .strongly non-Feller .specification, and as.sume further that 11 is nonnull 
with respect to an a priori mea.sure iJ^ that gives nonzero mea.sure to every open set. 
Let fi be any mea.sure consistent with 11. Then fi is not consistent with any Feller 
.specification. 

Proof, (a) Let A ^ S and / G C(0) be such that tta/ is strongly discontinuous. If 
now n' is a Feller specification, by definition tt^/ is continuous, and therefore differs 
from tta/ on a set having nonempty interior. But since fi gives nonzero measure to 
every open set, tta/ and tt^/ cannot be equal /i-a.e.; fi cannot be consistent with both 
n and n'. 

(b) is an immediate consequence of (a), once we realize that any measure consistent 
with a nonnull specification (Definition 2. II) must give nonzero measure to every open 
set. ■ 

Since the FK specification (4.91) is clearly nonnull (for < p < I and q > 0), we 
conclude: 

Corollary 4.17 Let q ^ 1, and let fi be any mea.sure consistent with the FK .speci- 
fication (4-91) [for V = Z and B = nearest-neighbor bonds in Z J. Then ji is not 
consistent with any Feller (= quasilocal) .specification. 

We note that the method used here to prove non-quasilocality is essentially the same 
as that used in Sections 4.1-4.3 on the RG examples. The only difference is that here we 
are working with an explicit specification, so that we can prove bounds over the whole 
sets A/+ and A/"-; whereas in Sections 4.1-4.3 we were working with the conditional 
probabilities of a given measure //', which are defined only up to modification on //'- 
null sets, and therefore we could only prove the bounds over A/+ and A/"- in the /i'-a.e. 
sense. 

Finally, we remark that for integer <? > 2, there exists a joint model of interacting 
Potts spins and bond occupation variables — that is, a model whose state space is 
{I, . . . , q}^ X {0, 1}^ — whose marginals on the spin and bond variables are the Potts 
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and random-cluster models, respectively [109]. This joint model has local interactions, 
so its specification obviously quasilocal. (The only reason it isn't Gibbsian is that there 
are some exclusions.) The identities relating the joint. Potts and random-cluster models 
are easily proven in finite volume, but they can presumably be made rigorous in infinite 
volume by methods like those sketched in Section 4.2, Step 0. If so, then any Gibbs 
measure of the joint model would produce, upon "decimation" to the bond variables, a 
non-quasilocal measure (namely, a measure consistent with the random-cluster-model 
specification).^^ This would then be another example in which "decimation" of a 
quasilocal measure yields a non-quasilocal measure. 

4.5.4 Stationary Measures in Nonequilibrium Statistical Mechanics 

Consider an infinite- volume lattice system evolving stochastically, in either continuous 
time or discrete time, according to (quasi)local rules which do not satisfy detailed 
balance. Thus, in continuous time we have in mind an interacting particle system 
[251]: for example, a system of the spin-fiip (resp. spin-exchange) type, in which each 
spin fiips (resp. each nearest-neighbor pair of spins exchanges values) independently, at 
Poisson random times, with rates depending in a (quasi)local way on the other spins. 
Examples of such dynamics include: 

(a) The voter model [251]: independently at each site x, at Poisson random times 
the spin ("voter") at x changes its value to that of a randomly chosen neighbor. 

(b) An Ising model with competing dynamics: for example, a mixture of Glauber dy- 
namics for two different temperatures [144], or a mixture of Glauber dynamics for 
one temperature and Kawasaki dynamics for a different temperature [362]. (The 
latter model has been considered by Lebowitz and his collaborators in connection 
with the hydrodynamic limit [238].) 

In discrete time we have in mind a probabilistic cellular automaton (PCA) [162, 240]: 
simultaneously at each clock tick, each spin attempts independently to fiip, again with 
rates depending in a (quasi)local way on the other spins. An example is: 

(c) The Toom model [347, 240]: each spin changes its value, with probability p, to 
the majority of its northern neighbor, its eastern neighbor, and itself, and with 
probabilities (1 —p)/2 to ±1. 

Thus, the PCAs are the discrete-time analogue of the spin-flip interacting particle 
systems. 

Lebowitz and Schonmann [246, p. 50] have argued that in both the continuous-time 
and discrete-time cases, the stationary measure(s) should generally be expected to be 
non-Gibbsian and indeed non-quasilocal: for "systems maintained in a nonequilibrium 



®^This would probably also give a method for proving that fif and //„, are consistent with the 
random-cluster-model specification, at least for integer q. 



147 



state by contacts with outside sources . . . [the measures describing] stationary non- 
equihbrium states cannot be expected to behave in a quasi-Markovian [in our language, 
quasilocal] way — isolating a part [of the system from the rest] will generally change 
its behavior drastically." 

This conjecture has been proven by Lebowitz and Schonmann [246] in the case of the 
voter model. More precisely, they have proven [246, equation (3.8)] that i[6^\pp) = 
where z/p (0 < /? < 1) is an extremal translation-invariant stationary measure of the 



voter model in Z [d > 3). This shows that fp is non-Gibbsian (as remarked also 
in [240]). It is interesting to note that this is the same large-deviations argument 
employed in the Lebowitz-Maes-Dorlas-van Enter examples (Section 4.4). 

Martinelli and Scoppola [259] have given another example of a dynamics in which 
the stationary measure is non-Gibbsian: again the probability of a region in which all 
the spins are -|- decays more slowly than exponentially in the volume of the region, so 
the measure cannot be Gibbsian. However, the Martinelli-Scoppola dynamics is highly 
non-local — it involves collective flips of arbitrarily large clusters — so perhaps the 
non-Gibbsianness is not so surprising. (The Martinelli-Scoppola dynamics superficially 
resembles the Swendsen-Wang [340] dynamics; but in truth the resemblance is not so 
close, since the stationary measure of the former is non-Gibbsian, while the stationary 
measure of the latter is the nearest-neighbor Ising model!) 

Finally, Maes and Redig [255] have described an (anisotropic) local spin- exchange 
dynamics in which the stationary measure is expected to have non-summable long- 
range correlations in the "high-noise" regime (i.e. at what ought to correspond to 
"high temperature"). Such unusual behavior would suggest, though it would not prove, 
that the stationary measure is non-Gibbsian. The long-range spatial correlations are 
indicated in this model by a perturbation calculation, but a more general physical 
intuition seems to be the following: Transport properties for spin-exchange processes 
are diffusive, and the correlation functions are expected to exhibit slow (power-law) 
decay in time ("long-time tails"). Now, one expects spatial and temporal correlations 
to have roughly similar decay — i.e. both exponential or both power-law — except 
in very special cases such as models satisfying detailed balance. This suggests that 
spin-exchange processes not satisfying detailed balance should have, quite generally, 
stationary measures with long-range correlations, and very likely, stationary measures 
that are non-Gibbsian. 

In the PCA models, the probability measure on the space-time histories is the 
Gibbs measure for a [d -\- I)-dimensional lattice model with interactions which can be 
expressed in terms of the transition rules of the PCA model [162, 240]. The station- 
ary measure of the PCA model thus corresponds to the restriction of this space-time 
measure to a J-dimensional (equal-time) hyperplane. When the PCA is in the "high- 
noise" regime — so that the associated [d-\- I)-dimensional equilibrium model is in the 
Dobrushin-Shlosman high-temperature regime — the stationary measure is known to 
be unique and Gibbsian [240]. (A similar theorem has recently been proven also for 
continuous-time spin-flip systems [257].) However, by analogy with the Schonmann 
example (Section 4.5.2), one may suspect that in the "non-ergodic" (phase-transition) 
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regime of the PCA model — where the stationary measure is not unique — each sta- 
tionary measure would typically be non-Gibbsian. In particular, one may conjecture 
that this is so for the Toom model. We thus suspect that Liggett's conjecture [251, p. 
224], to the effect that every translation-invariant finite-range dynamics with strictly 
positive rates has a Gibbsian stationary measure, is most likely false. 

Remark. The foregoing considerations are for rates that do not satisfy detailed 
balance. If the rates satisfy detailed balance, then one expects all the stationary mea- 
sures to be Gibbsian (for an explicit Gibbsian specification that is easy to write down 
given the rates); however, this has not yet been proven rigorously even in the Glauber 
dynamics for the nearest-neighbor Ising model in dimension d > 3 [251, Problem 
IV.7.1]. 

Finally, let us quote a result of Kiinsch [229] for continuous-time local spin-fiip 
processes with strictly positive rates: if there exists a translation-invariant stationary 
measure which is Gibbsian for some (absolutely summable) interaction, then every 
other translation-invariant stationary measure must be Gibbsian for the same interac- 
tion. 

4.5.5 Comparison of Methods for Proving Non-Gibbsianness 

Any theorem of the form "every Gibbs measure has the property "P" provides a method 
for proving non-Gibbsianness via the contrapositive: a measure not having the property 
V must be non-Gibbsian. We have seen four properties of this sort: 

(i) A Gibbs measure (for an absolutely summable interaction) must be uniformly 
non-null. This is a consequence of the "easy half" of the Gibbs representation 
theorem [Theorem 2.12 (a) =^ (b)]. 

(ii) A Gibbs measure (for a uniformly convergent interaction) must be quasilocal 
[Theorem 2.10]. 

(iii) A measure can be Gibbsian for at most one (uniformly convergent, continuous) 
interaction, up to "physical equivalence" [Corollary 2.18]. 

(iv) Translation-invariant Gibbs measures (for translation-invariant absolutely summable 
interactions) have "good" large-deviation properties: the probability that spins in 
a certain region fiuctuate into a configuration characteristic of another translation- 
invariant measure decreases exponentially in the volume of the region, except if 
this other measure is also Gibbsian for the same (absolutely summable) interac- 
tion. In precise mathematical terms: a translation-invariant measure fi has zero 
relative entropy density respect to another translation-invariant measure p which 
is Gibbsian for an interaction $, if and only if fi is also Gibbsian for the same 
interaction $. This is one of the consequences of the discussion of Section 2.6.6. 
It is also closely related to the strict convexity of the pressure [Proposition 2.59]. 

For each of these conditions, we have seen examples in which the non-Gibbsianness 
is proven by its violation: 
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(i) Lack of uniform nonnullness. This has two manifestations: A measure can 
be nonnuU but not uniformly so (see Definition 2.11). This typically means that the 
Hamiltonians are unbounded functions and one cannot use the formalism developed 
for absolutely summable interactions. This is the generic situation for unbounded- 
spin models, and it gets delicate for infinite-range interactions. In these cases, often 
the notion of Gibbsianness can be preserved if one excludes "by hand" problematic 
configurations [245, 60]. On the other hand, the measure may fail to be nonnuU, 
which means that some cylinder sets have zero measure. This is a rather simple case 
of non-Gibbsianness in which the Gibbsianness can be restored by allowing hard-core 
interactions or working on a more restricted configuration space (see for example [313]). 
We mention that, in the setting of complex interactions, there are examples of Gibbsian 
measures that after one renormalization step remain quasilocal but lose nonnullness 

[14].^^ 

(ii) Violation of quasilocality. Most of the cases of pathological renormalization 
transformations analyzed above (Sections 4.1-4.3 and 4.5.2) fall into this category. This 
phenomenon appears when there are some "hidden spins" that transmit information 
from arbitrarily far away even if the "non-hidden" spins are fixed. In the renormal- 
ization transformations the "hidden variables" are the fiuctuations of the original or 
internal spins that remain once the block spins are fixed. In Schonmann's example 
(Section 4.5.2), the "hidden variables" are all the spins outside the x-axis, which are 
"hidden" by the process of restriction. 

(iii) Threatened violation of uniqueness. We used this method to study the "trivial" 
examples of non-Gibbsianness discussed in Section 4.5.1. Consider a finite or countable 
family of different (non-physically-equivalent) interactions and pick for each one an er- 
godic translation-invariant Gibbs measure. Then a nontrivial convex combination of 
these measures cannot be Gibbsian for any (uniformly convergent, continuous) interac- 
tion, because if it were, then each of the original measures would be a Gibbs measure 
also for this new interaction, violating uniqueness. In the case of a finite single-spin 
space, this method also proves non-quasilocality. 

(iv) Wrong large deviation properties. There seem to be two rather different types 
of "bad" large-deviation properties: 

(a) Sub-exponential decay for events whose probability "should" decay exponentially 
in the volume. This applies to the sign field of the (an)harmonic crystal (Section 
4.4), and the stationary measures for the voter and Martinelli-Scoppola models 
mentioned in Section 4.5.4. Here one shows that the probability of all the spins in 
a large region becoming -|- decays sub-exponentially in the volume of the region; 
this is incompatible with being Gibbsian for any absolutely summable interaction. 
In other words, one shows that the measure fi satisfies i[6^\fi) = 0, where 6^ is 
the delta-measure concentrated on the all--|- configuration. As this measure is 
obviously non-Gibbsian (it is not nonnuU!), neither is //. 

(/3) Exponential decay for events whose probability "should" decay sub-exponentially. 
The original proof of Schonmann's example [318] is based on an argument of this 
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kind. Here one shows that, in the + phase, the probabihty of having a net 
negative magnetization in a large region decays exponentially in the volume of 
the region. In other words, the measures obtained via "+" and "— " boundary 
conditions have a strictly positive relative entropy. If either of these measures 
were Gibbsian (for an absolutely summable interaction), the other would have 
to be Gibbsian for the same interaction (because they differ only by boundary 
conditions); but then the relative entropy would have to be zero (Theorem 2.66). 
Therefore, they cannot be Gibbsian. 

Often we would like to prove not only that a measure is non-Gibbsian, but also that 
it is non-quasilocal (which is stronger). In nearly all cases we have done this "by hand": 
that is, by proving bounds on the conditional probabilities which are incompatible with 
their having any quasilocal version (see Sections 4.1-4.3 and 4.5.2). In only one case 
were we able to prove non-quasilocality by an abstract "trick": this was the "trivial" 
convex-combination example (Section 4.5.1), where we used method (iii) above. It 
would be interesting to have available other methods for proving non-quasilocality. 

4.5.6 Are "Most" Measures Non-Gibbsian? 

The traditional belief among physicists (including ourselves until recently) is that all 
(or nearly all) physically interesting measures are Gibbsian. Indeed, this belief is so 
much taken for granted^^ that it is rarely stated explicitly.^"^ The profound message of 
Israel's pioneering work [207], and of the examples given here, is that this traditional 
belief is false: many physically interesting measures are non-Gibbsian. In fact, we now 
suspect that Gibbsianness should be considered to be the exception rather than the 
rule — that, in some sense, m.ost measures are non-Gibbsian. 

It is therefore of at least mathematical interest to study the set Q = \J Qinvi^^) 

of all translation-invariant measures which are Gibbsian for some translation-invariant 
absolutely summable continuous interaction. Is ^ a "big" or a "small" subset of the 
space M^i^inv{^) of all translation-invariant measures? 



^^There are many examples of this in the physics literature: see, for example, [54, 49]. 

®^One exception is the recent statement by a noted mathematical physicist that "every good ran- 
dom field is Gibbsian" [324]. In a similar vein, a mathematician says: "the Gibbsian form of local 
conditional distributions is a rather weak condition, but it is difficult to check it." [229, p. 410] A 
related though somewhat weaker intuition can be found in a well-known monograph on interacting 
particle systems: "Is it true that every translation invariant strictly positive spin system on Z with 
finite range has an invariant measure which is a Gibbs state? This is plausible . . . [because] the strict 
positivity of the rates should imply that an invariant measure is somewhat smooth." [251, p. 224] 
On this same conjecture, another mathematician says: "We couldn't prove in general the existence 
of a stationary Gibbs measure, although this is very likely to hold." [229, p. 408] As discussed in 
Section 4.5.4, this conjecture is still an open problem, but there is good reason to suspect that it is 
false. (These examples, together with those of the preceding footnote, illustrate the difference between 
physicists and mathematicians: both often have erroneous intuitions, but the mathematicians state 
them explicitly.) 
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It is a "big" set in a very weak sense, namely that of being dense in the weak topol- 
ogy. In fact, the Gibbs measures for finite-range continuous interactions are already 
dense: 

Proposition 4.18 Assume that the single-spin space Oq is a compact m.etric space, 
and that the a priori single-spin m,easure fi^ gives nonzero measure to every open set 
offto. Then 

Q finite = U ^m-u(n ) 

is dense in Af_|_i^m^,(0) in the weak topology. 

Proof. The proof goes in three steps: First, the ergodic measures of finite entropy 
density (relative to /i°) are dense in Af_|_i^m^,(0) [Proposition 2.61(e)]. Secondly, Israel 
[206] has shown, using the Bishop-Phelps theorem, that each ergodic measure of finite 
entropy density is an (extremal) equilibrium measure for some interaction $* G S° 
(see item 2 in Section 2.6.7). Finally, the finite-range interactions form a dense subset 
B finite C S°; and it follows from a theorem of Lanford and Robinson [231] (see also Sokal 
[332]) that every extremal equilibrium measure for $* G B^ can be approximated in the 
weak topology by equilibrium measures for interactions $„ G B finite with ||$ra — $*||bo -^ 
0. ■ 

We emphasize that density in the weak topology is an extremely weak property: it 
means only that an arbitrary measure // G Af_|_i^m^,(0) can be approximated arbitrary 
closely, with regard to any finite family of local observables, by a measure in Q finite- 
In particular, the long-range-order properties of the approximating measures can be 
totally different from those of the limiting measure //. Thus, Proposition 4.18 is very 
far from saying that "most" measures are Gibbsian. 

In a more profound sense we expect that Q is in fact a rather "small" subset of 
Mj^\^inv{^)- For example, we conjecture: 

Conjecture 4.19 (a) Q is a set of first Baire category in Af_|_i^m^,(0). [That is, Q is 
a countable union of sets which are nowhere dense in M^i^inv{^)-J 

(h) Q n exAf_|_i^m^,(0) is a set of first Baire category in exAf_|_i^m^,(0). [Here "ex" 
denotes the extreme points, i.e. the ergodic mea.sures.] 

First Baire category is a classic notion of "smallness" in topology [285]. 
We can make some small steps toward proving Conjecture 4.19(a): 

Proposition 4.20 Q has empty interior. 

Proposition 4.21 As.sume that the single-spin space Oq is a compact metric .space, 
and that the a priori single-spin mea.sure fi^ gives nonzero mea.sure to every open set 
offto. Let S he a compact .subset of B^ , and let £s he the set of equilibrium. mea.sures 
for interactions in S. Then £s is a compact .subset o/ Af_|_i^m^,(0). 
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Corollary 4.22 Assume that the single-spin space Oq is a compact m.etric space, and 
that the a priori single-spin measure /i° gives nonzero measure to every open set of 
Oq. If S is a cr-compact subset of B^ , with S C B^ , then Ss = Qs = U Qinvi^^) 

is a-compact and of first Baire category in Af_|_i^m^,(0). In particular, this occurs for 
S = Bh with h^ 1. 

Proof of Proposition 4.20. Let // G ^ and v G M+i^^nv\G- Then, by Proposition 
2.48(b), {l-X)ji + Xp ^ g foT 0< X < I. But {1-X)ji + Xp^ // weakly as A i 0. Hence 
Q cannot contain any open neighborhood of //. [In this proof we could equally well have 
taken ;/ to be a Gibbs measure for an interaction not physically equivalent to the one 
for which fi is Gibbsian, and then apply Proposition 2.48(b) and the Griffiths-Ruelle 
theorem.! ■ 



Proof of Proposition 4.21. M+i^j„^,(0) is compact, so we need only show that 
£s is closed. Let //„ be an equilibrium measure for $„ G S*, with fin ^^ f- weakly. Then, 
since S is compact, there exists a subsequence $„^ that converges (in S° norm) to some 
$ G S*. But then fi is an equilibrium measure for $. ■ 

Proof of Proposition 4.22. The first statement is an immediate consequence 
of Propositions 4.20 and 4.21. The second statement follows from Proposition 2.39(b). 



We thank S.R.S. Varadhan for suggesting these latter results and sketching the 
proofs. 



5 Discussion 

5.1 Numerically Observed Discontinuities of the RG Map 

5.1.1 Statement of the Problem 

In several Monte Carlo renormalization group (MCRG) studies [33, 233, 72, 163], it has 
been found that the numerically computed renormalization transformation TZ: H \-^ H' 
is discontinuous at a first-order phase-transition surface. ^"^ However, this behavior is 



^■^The models in which this behavior has been (at least tentatively) observed include the two- 
dimensional Ising model at low temperature [72], the 10-state Potts model in two dimensions [72], the 
3-state Potts model in three dimensions [33], the Z2 lattice gauge theory in four dimensions [163] and 
the U(l) lattice gauge theory in four dimensions [233, 72]. However, in a more recent study of the two- 
dimensional Ising model at low temperature [165], the observed discontinuity was always less than 
the estimated truncation error, and it decreased as more terms were included in the renormalized 
Hamiltonian; this was interpreted as evidence against a discontinuity in the exact renormalization 
map. 
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rigorously excluded by our Second Fundamental Theorem (Theorem 3.6). In this sec- 
tion we would like to offer our interpretation of the numerically observed discontinuities. 

A MCRG study [337, 338] proceeds as follows: We choose an original Hamiltonian 
i7, and generate a long sequence of random samples cui , c<;2 , . . . from the Gibbs measure 
/i = const X e~^ using some Monte Carlo procedure. On each of these "original-spin" 
configurations coi we apply the renormalization map T to generate the corresponding 
block-spin configuration lo'-. In this way we have generated a random sample (jj[^(jJ2^ ■ ■ ■ 
from the renormalized measure //' = fiT. It is now assumed that //' is the Gibbs 
measure for some renormalized Hamiltonian H' belonging to a fixed finite-parameter 
family -ff(Ai, . . . , Ajv), and some statistical method [339, 164, 9] is employed to estimate 
the unknown parameters Ai, . . . , Ajy. 

Such a procedure has three sources of error: 

1) Statistical error arising from the finite Monte Carlo sample. 

2) Systematic error arising from the finite lattice size. (We take the point of view 
that our goal is to learn about the behavior of the infinite-volume system.) 

3) Systematic error arising from truncation of the renormalized Hamiltonian: //' may 
not be (in fact, in almost all cases is not) a Gibbs measure for any Hamiltonian 
in the assumed A^-parameter family. We include here the possibility — studied 
in detail in Section 4 — that //' is not the Gibbs measure for any reasonable 
Hamiltonian. 

It is useful to study these three sources of error separately . In particular, we would like 
to study the problem of truncation of the renormalized Hamiltonian, independently of 
the problems of statistical and finite-size errors. Therefore, we begin by formulating 
an idealized model of the parameter-estimation problem in which we assume that the 
experimenter knows exactly the expectation values of an appropriate set of observables 
(to be specified later) in the infinite-volume renormalized measure //'. This idealized 
situation can be approximated to arbitrary accuracy with sufficient computer time, 
by making long Monte Carlo runs on large systems. (In principle we should then 
discuss the stability of our theory relative to small statistical or finite-size errors. But 
we feel that our considerations are still too preliminary to justify entering into such 
technicalities.) 

5.1.2 An Idealized Model of Parameter Estimation 

Let us first consider the parameter-estimation problem in a general probabilistic (= 
statistical-mechanical) context, without regard (for the moment) to the renormalization- 
group application. Let, therefore, (0,JF) be an arbitrary measurable space, let F be 
some family of probability measures on (0,JF), and let p be another probability mea- 
sure on (0,JF). We wish to find the measure in F which is in some sense "closest to" 
(or "best approximates") p. How should we define "closeness"? Any definition is, of 
course, somewhat arbitrary, but we claim that the following definition is very natural: 
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The measure in F closest to /?, denoted /? , is the one which minimizes 
the relative entropy I[p\ ■ ), assuming that a minimizer with finite relative 
entropy exists (it may or may not be unique). 

Note that the unknown measure is taken here as the reference measure (second ar- 
gument) in the relative entropy.^^ In support of this definition, we cite the following 
properties: 

1) If /O G F, then p = p, uniquely. This is a rather trivial property, but it is at 
least a necessary condition for any reasonable definition of "closeness". 

2) Suppose that one generates a large random sample from /?, and constructs 
maximum-likelihood estimates [325] based on the (false) assumption that the sam- 
ple arose from some measure in F. In the large-sample limit, this maximum-likelihood 
estimate will converge to p . This can be proven under suitable technical hypotheses 
[196], but it is easy to see intuitively why it is true: the relative entropy I{p\i^) is, up 
to an additive constant, precisely minus the mean (under the true measure p) of the 
log likelihood function: 

I{pW) = J dp log ^ 

= const —dp log "Jz/" , (5-1) 



so maximizing the likelihood is equivalent to minimizing the relative entropy. Thus, 
p is the estimate that would be generated by an experimenter possessing an infinite 
random sample from p and using the optimal estimation method (namely, maximum 
likelihood). ^^ (Note also that the maximum-likelihood estimate for a finite sample 
cui , . . . , cUn is the measure in F closest to the empirical measure Ln = n-~^ Yl^=i ^tu, 
for the given sample. This close relation between maximum-likelihood estimation and 
relative entropy has been noticed by previous authors.) 

Suppose now that the set F is of the Boltzmann-Gibbs form (= exponential family) 



ux = Z{\) exp 



N 



z2 ^«-^« 



8 = 1 



/: AgR^I (5.2) 



for some specified family i^i, . . . , H^ and a priori measure /i°. We can assume without 
loss of generality that the functions 1,-ffi, . . . , -ff at are linearly independent (/i°-a.e.). 



®®This follows Cencov [61]. (Note, however, that Cencov's notation for the arguments of 7( • | • ) is 
the reverse of ours.) By contrast, Csiszar [68, 69] considers the quite different problem in which the 
unknown measure is the first argument in the relative entropy. 

®®This assertion is perhaps somewhat misleading: The maximum-likelihood method is optimal 
as regards statistical errors (in the large-sample limit) [325], while here we are concerned with the 
systematic errors due to truncation. Indeed, the problem here is to define what we mean by the 
"optimal" truncation. In any case, we claim that maximum-likelihood estimation is a sensible idealized 
model of what a good experimenter would actually do if he/she could. 
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Then the relative entropy 



N 

I{p\iyx) = logZ(A) + J2X,jH,dp + const (5.3) 

8 = 1 

is a strictly convex function of A; in particular, the measure p is unique if it exists at 
alP^, and it is defined by the conditions 

{H,),, = {H,)p foralH = l,...,iV. (5.4) 



The foregoing theory is adequate for parameter estimation in finite-volume systems; 
but for infinite-volume systems it is inapplicable, because the relative entropy of two 
translation-invariant measures is in nearly all cases -|-oo. The problem here is that, as 
discussed in Section 2.6, the relative entropy in volume A typically grows proportionally 
to the volume (unless the two measures happen to be Gibbs measures for the same 
interaction). This volume factor is uninteresting in the present context, because it does 
not depend on the parameters A over which we want to optimize. Therefore, it makes 
sense to just divide out this volume factor and minimize the relative entropy density. 
That is, if F C Af_|_i^m^,(0) and p G Af_|_i^m^,(0), we define: 

The measure in F closest to /?, denoted /? , is the one which minimizes 
the relative entropy density z(/j|-), assuming that these relative entropy 
densities are well-defined and that a minimizer with finite relative entropy 
density exists (it may or may not be unique). 

Suppose now that F is the set of translation-invariant Gibbs measures for interac- 
tions $ G V (and a priori measure /i°), where V = span($i, . . . , $jv) is some specified 
finite-dimensional linear subspace of B^ . Then, from (2.96) we have 

i{p\v) = p[-f^\/j,^) + J Udp + i{p\p°) 

= Fp{^) (5.5) 

whenever ;/ is a Gibbs measure for $. Thus, i{p\i^) depends on p only via the interaction 
$; in fact, Fp[^) is precisely the amount by which the pair (/O, $) fails to satisfy the 
variational principle. Therefore, if one Gibbs measure for $ happens to minimize 
z(/j| • ), then all Gibbs measures for $ do so. So the measure in F closest to p may not 
be unique. Nevertheless, the corresponding interaction is necessarily unique (modulo 
physical equivalence) if it exists at alP®, because Fp is strictly convex on V C B^ 



®^The minimizer p could fail to exist: Consider, for example, il = { — 1, 1}, /U*^ = k{^-i + ^+i)j 
p = 6-1, N = 1 and Hi(ui) = w. Then the minimum is "at A = -|-cxd"; there is no minimizer at finite 
A. 

®® Again, a minimizer could fail to exist, if the minimum is "at infinity". This occurs, for example, 
if /9 is a ground-state measure for some interaction $ G T/: then Fp(p<^) ^ as /3 ^ -|-cxd. 
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(Proposition 2.59). This is good, because it is after all the interaction that we would 
like to estimate. Now, an interaction $* minimizes Fp\V if and only if /ot/[y] is a 
tangent functional at /$* to the pressure restricted to f[V]. [Here f[V] denotes the 
image of V under the map ^ i— )• /.j; it is a linear subspace of C(0).] Now, by the Hahn- 
Banach theorem^^, every tangent functional to pt/[^] can be extended to a tangent 
functional to p, i.e. to an equilibrium (= Gibbs) measure. It follows that an interaction 
$* minimizes Fp \V if and only if there exists a translation-invariant Gibbs measure p 
for $* such that 

(/$.). = {U^)p foralH = l,...,iV. (5.6) 

What happens if we consider larger and larger subspaces of interactions? Let Vi C 
^2 C ... be an increasing sequence of finite-dimensional linear subspaces of B^^ whose 
union is dense in B^. Let $* be the interaction in Vn that minimizes Fp\Vn- Then it 
is natural to conjecture the following: 

Conjecture 5.1 (i) If p is a Gibbs measure for some interaction $ G B^ , then 
$*—)•$ in B^ norm, as n — )• oo. 

(a) If p is not a Gibbs measure for any interaction in B^ , then ||$*||bi — > oo as 
n — )• oo. 

We are not able to prove this much (and we suspect that it may not be true without 
additional hypotheses). Regarding conjecture (i), what we can prove is the following: 

Proposition 5.2 Let p be an ergodic translation-invariant measure of finite entropy 
density (relative to p^ ); and let V\ d V2 d ■ ■ ■ be an increasing sequence of subsets of 
B^ , whose union is dense in B^ . Then: 

(a) There exists an interaction $ G S° (not necessarily in B^ !) for which p is an 
equilibrium, measure. 

(b) Let $ be any interaction in B^ for which p is an equilibrium, measure. Then there 
exists a sequence $„ G Ki which converges to $ in B^ norm. If, in addition, $ 
belongs to some .space Bh C B^ and [J'^_-^Vn is dense in Bh (in Bh norm.), then 
there exists a sequence $„ G Ki which converges to $ in Bh norm.. 

(c) Let $ be any interaction in B^ for which p is an equilibrium. mea.sure, and let {^n) 
be any sequence converging to $ in B^ norm. Then Fp[^n) -^ 0. [In particular, 
we have lim inf Fo(^) = 0./ 

(d) Conversely, let {^n) be any sequence for which Fp($„) — )• and which converges 
in B^ norm, say to $00 • Then p is an equilibrium. mea.sure for $00 • In particular, 
(f(||$„||Bi) is bounded, then $00 £ B^ and p is a Gibbs mea.sure for $oo- 



®^The required version of the Hahn-Banach theorem [313, p. 157, A. 3. 2] can be deduced easily from 
the separating-hyperplane version ([315, p. 46, Theorem II. 3.1] or [310, p. 58, Theorem 3.4(a)]) by 
considering epigraphs. 
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This shows that if /? is a Gibbs measure for $ G S^, then there exists a sequence 
$ra £ Ki of approximate minimizers which converges (in B^ norm) to $. Unfortunately, 
there is no guarantee that the exact minimizers $* (if such exist) converge to $; they 
might fail to converge, or they might converge instead to some interaction $ G S° \ S^ 
for which p is an equilibrium (but not Gibbs!) measure. It is an important open problem 
to find conditions under which conjecture (i), or something like it, can be proven. 

Remark. It is certainly possible for a sequence $„ of approximate minimizers of 
Fp to fail to converge even when p is a Gibbs measure for some interaction in B^. 
Consider, for example, an Ising model: take p = p^ = product measure, and let $„ 
be a ferromagnetic two-body interaction n~'^ f{n~^{x — y))ax(Jy^ where / is some fixed 
nonnegative smooth function with < Jf{x)d'^x < 1. In such a situation, Fp($„) = 
Ff,„{^n) = p{-f^ |/«°)- But then F^o((|„) ^ by the Lebowitz-Penrose theorem [241] 
[345, Appendix C], which tells us that the so-called "Kac limit" lim^^oo p(— /$„ |/W°) 
is the high-temperature mean-field pressure, which is in our normalization. On the 
other hand it is obvious that po is the Gibbs measure for the absolutely summable 
interaction $ = 0, and that nevertheless the interactions $„ do not converge in S° or 
any of its subspaces Bh- [Moreover, the measures pn defined by the interactions $„ 
converge to the product measure po.] We fear that something similar could happen 
also for the exact minimizers $*, unless the spaces Vn are very carefully chosen. 

On the other hand, we can almost prove conjecture (ii): 

Proposition 5.3 Let h: S ^ [I7O0) be a translation-invariant weight function, and 
let p he a translation-invariant measure which is not an equilibrium, measure for any 
interaction in Bh- Let {^n) be any sequence in Bh for which Fp($„) — )• 0. Then at least 
one (and possibly both) of the following two statements is true: 

(a) lim ||$n||Bfe = 00. 

(b) ($ra) does not converge in B^ norm. 

Moreover, if the single-spin space Oq is finite and h^ 1, then statement (a) is always 
true. 

This comes very close to proving conjecture (ii): the only possible escape clause 
is that the sequence ($ra) might have no limit at all (even in B^ norm), even though 
(||$„||bi) is bounded. This can happen, for example, if the interactions become longer- 
and-longer-ranged but with bounded total strength.™ In such a case one must have 
ll^rallBfe — > 00 in every space Bh of "short-range interactions" (Definition 2.38), e.g. 
h{X) = diam(X)^ with e > 0. 



^"^This situation is reminiscent of "mean-field-like" interactions. However, in such situations one 
usually expects (and in some cases can prove, as in the previous remark) that the limiting measure p 
ts Gibbsian for some interaction $ G Z?^ which has picked up a magnetic field. We wish to thank Bob 
Griffiths and Bob Swendsen for a discussion of this point. 
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Proof of Proposition 5.2. (a) is a special case of a theorem of Israel [206, 
Theorem V. 2. 2(a)]. (b) follows from the density of U^^iVn in S° (or Bh)- (c) follows 
from the (Lipschitz) continuity of the function Fp in S° norm, (d) is also a consequence 
of the continuity of Fp, since the hypotheses imply that -F'p($oo) = 0. The last statement 
is a consequence of Proposition 2.39(a) applied to Bh = B^. ■ 



Proof of Proposition 5.3. Suppose that ($„) converges in B° norm to (loo- 
Then -F'p($oo) = 0, so /? is an equilibrium measure for $oo- By hypothesis this means 
that $oo ^ Bh^ i.e. H^ooHb^ = oo. Now assume that ||$ra||Bh 7^ 00; then there is a 
subsequence of ($ra) on which the Bh norm is bounded, say by M; but by Proposition 
2.39(a) this implies that H^ooHb^ < M^ a contradiction. This proves that either (a) or 
(b) [or both] must be true. 

Finally, suppose that the single-spin space is finite, that /i ^ 1, and that there is a 
subsequence of ($ra) on which the Bh norm is bounded, say by M. Then by Proposition 
2.39(b) there exists a sub-subsequence which converges in S° norm to some $00 with 
ll^oollBfe ^ M; and p is an equilibrium measure for $00! but this contradicts the 
hypothesis of the proposition. ■ 

Remarks. 1. Similar ideas appear in the work of Hugenholtz [199]. 

2. A partially alternate proof of the second half of Proposition 5.3, when p is 
ergodic (or a finite convex combination of ergodic measures), goes as follows: By the 
Bishop-Phelps theorem [206, Corollary V.2.1] there exists $„ G S° with ||$ra — ^nllB" ^ 

CpFp[^n) such that p is an equilibrium measure for $„ [if p = Y, (^iPi is the ergodic 
decomposition of /?, then Cp = (2 min ctj)"^]. Now assume that ||$ra||Bh 7^ 00, so that 

l<«'<?Ti 

there is a subsequence of ($ra) on which the Bh norm is bounded, say by M . We have 
then shown that there exist interactions $„ in S°, arbitrarily close to {$: II^Hb^ ^ ^l? 
for which p is an equilibrium measure. When this S/j-ball is compact in B^ (i.e. Oq finite 
and /i ^ 1), then it is easy to show that there exists an interaction in this ball for which 
p is an equilibrium measure [we've done it above, by extracting a sub-subsequence of 
($„) convergent to $00! for what it's worth, the corresponding sub-subsequence of ($ra) 
also converges to $oo]- Unfortunately, if the S/j-ball is not compact, we do not see any 
way to conclude that there exists an interaction in the ball (or even in Bh) for which 
p is an equilibrium measure. So this method still does not suffice to prove conjecture 



5.1.3 Application to the Renormalization Group 

Now let us apply these ideas to the renormalization group, by taking p to be the 
renormalized measure pf . We assume that the experimenter uses the scheme described 
in the previous section to construct an estimated renormalized interaction $^ G Ki. 
We continue to ignore statistical and finite-size errors. 
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The expected behavior of the estimates $^ depends critically on whether //' is 
Gibbsian or non-Gibbsian. Assuming the validity of Conjecture 5.1 (or something like 
it), we have the following scenario: 

Case (i): fi' is Gibbsian for $' G S^. Then we expect the estimated renormalized 
interactions $^ to converge in B^ norm to $'. Now, by the First Fundamental Theorem 
(Theorem 3.4), the renormalized measures arising from distinct phases of the original 
model must be Gibbsian for the same interaction $'. Therefore, any observed multi- 
valuedness of the RG map must disappear asymptotically as the assumed interaction 
space Vn grows. 

Case (a): fi' is non-Gibbsian. In this case we expect the estimated renormalized 
interactions $^ to diverge in B^ norm, i.e. ||$^||bi ~^ oo- This behavior is almost 
rigorously proven (Proposition 5.3). 

This dichotomy provides, at least in principle, a clear method for distinguishing 
experimentally the Gibbsianness or non-Gibbsianness of the renormalized measure //'. 
Whether it will work in practice is less clear: the proofs of non-Gibbsianness in Section 
4 (and of the Fundamental Theorems in Section 3) involve extremely rare events in large 
volumes; so the distinction between Gibbsianness and non-Gibbsianness might turn out 
to be visible only with extremely high statistics and when using an extremely large 
space of renormalized interactions (that is, including interactions involving many spins 
simultaneously). On the other hand, it is at least conceivable that this dependence on 
rare events is an artifact of the proof and not of the result. It would be interesting, 
therefore, to perform a high-precision MCRG test, using a large space of renormalized 
interactions, to compare a case in which //' is expected to be Gibbsian (e.g. the d = 2 
Ising model at a temperature not too far below critical) with a case in which //' is 
expected or proven to be non-Gibbsian (e.g. the d = 2 Ising model at low temperature). 
We are somewhat pessimistic about whether the asymptotic (n — )• oo) behavior can be 
seen with any currently feasible expenditure of resources, but it cannot hurt to try. 

In the existing MCRG studies, the interaction space V is usually taken to be quite 
small: typically 1 < dimy < 10. Can we explain the observed discontinuity of the 
RG map as an artifact of this truncation to a small space of interactions? In our 
opinion the answer is yes. Note first that the estimated renormalized interaction $' is, 
according to (5.4)/(5.6), just a proxy for the renormalized expectation values (/ij)^', 
^ G V. These latter expectation values are, of course, discontinuous at a first-order 
phase-transition surface (and multi-valued on that surface). That in itself does not 
imply the discontinuity and multi-valuedness of $', because the map from interactions 
to expectation values is itself discontinuous and multi-valued. However, for most renor- 
malization transformations we expect the renormalized expectation values to be m.ore 
discontinuous than the original expectation values; and it is far from clear that this 
larger discontinuity can be realized, simultaneously for all observables fq, (^ G V), at 
any interaction in the given space V. If it cannot, then the RG map on the space of 
interactions will appear to be discontinuous. 

Consider, for example, an Ising model at h = and /3 > /3c, and use the majority- 
rule transformation. Then the renormalized magnetization M' will undoubtedly be 
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larger than the original magnetization M = Af(/3,0~'~), since minorities tend to get 
outvoted. Now suppose that one is using, as in the work of Decker, Hasenfratz and 
Hasenfratz [72, Section 4], only a single renormalized coupling h' ^ with /3' fixed to equal 
/3. (That is, V is a one-dimensional affine subspace.) Then one will inevitably find a 
renormalized coupling h' > for the image measure /i'_j_ (resp. h' < for //'_), since only 
in this way can one account for a renormalized magnetization M' = M[l3^ h') > M . 
Decker et al. do recognize this objection, and try to argue that allowing fi' to vary 
would not produce an effect large enough to account for the observed discontinuity in 
h\ but we do not find their argument convincing. 

The situation is more subtle if one considers the two-dimensional space of couplings 
fi' and h' . Then one has to choose the pair (/3', h') so as to match the observed 
renormalized magnetization M' and the observed renormalized energy E' . To do this, 
let us determine first the unique value /3^ such that the renormalized energy can be 
matched at zero magnetic field, i.e. E' = E{fil, 0). Then we ask how the renormalized 
magnetization M' compares to the spontaneous magnetization at /3^: 

(a) If M' > Af(/3^, 0"*"), then it is impossible to match both M' and E' at zero 
magnetic field. Therefore, the renormalized coupling h' will be found to be > 
(resp. < 0) for the image measure /i'_j_ (resp. //'_). 

(b) If M' < M{^1, 0+), then M' and E' can be matched by taking /3 = ^l, h' = 0. [If 
M' < M[l3l^ 0"*"), this entails using a m,ixed phase v in (5.6), but that is perfectly 
legitimate. It corresponds to the minimum of Ep \V occurring at a point of non- 
differentiability.] 

We are unable to decide a priori between these two possibilities; it seems to be a 
detailed dynamical question. 

One approach is to compute the low-temperature expansion of M' and £", and 
compare them to the corresponding expansions for £'(/3,0) and Af(/3,0). This would 
answer the question at sufficiently low temperature. We are indebted to Jesiis Salas 
[314] for performing this computation, for the two-dimensional Ising model at /i = 
using majority rule on 2 X 2 blocks (with a random tie-breaker). Setting u = e~^^\ 
M = (ctq) and E = (crocr(i^o))7 Salas finds: 

M'(m,0+) = I - 4^3 - 32m^ + O(m^) (5.7a) 

^'(u,0) = I - 8m^ - 63m^ + 0{v^) (5.7b) 

These are to be compared with the well-known results 

M(m,0+) = I - 2^2 - 8m^ - 34m^ + 0{v^) (5.8a) 

^(u,0) = I - 4^2 _ 12^3 - 36m^ + O(m^) (5.8b) 

Matching the energies, we find 
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Plugging this into Af , we have 

M«,0+) = 1 - 4^3 - —u* - 472^9/2 ^ ^(^5^ ^ ^5_^0^ 

which is equal to Af'(M,0~'~) at leading order and greater than Af'(M,0~'~) at order u"^. 
We conclude that at low temperature M' and £" can be matched at /i = in the 
two-dimensional Ising model with the 2x2 majority-rule transformation. However, we 
do not know what will happen with other transformations. 

Similar remarks apply in the case of higher-dimensional interaction spaces V. While 
we are unable to prove that a discontinuity will inevitably be observed, neither do we 
see any reason to believe that the renormalized expectation values (/<j)^' can always 
be matched, simultaneously for all ^ G V and all phases //', by some interaction 
$' G V. Therefore, we must expect that typically the observed RG map will be 
multi-valued and discontinuous at a first-order phase-transition surface, purely as an 
artifact of the truncation of the renormalized interaction. Of course, if the image 
measure //' is Gibbsian, then this discontinuity should go to zero asymptotically as 
the assumed interaction space Vn grows. If //' is non-Gibbsian, then we expect the 
estimated renormalized interactions $^ to diverge in B^ norm, and it is perfectly likely 
that the $^ corresponding to different phases will diverge in different ways. 

We think that this explains the numerically observed discontinuities of the RG map, 
irrespective of whether the renormalized measure //' is Gibbsian or not. 

5.2 A Remark on Dangerous Irrelevant Variables 

The renormalization-group description of critical behavior in its simplest form seems 
to imply hyperscaling relations such as 

diy = 7' + 2/3 (5. II) 

diy = 2A4-7 (5.12) 

diy = 2- a (5.13) 

d + 2-T] , , 

8 = 5.14 

d-2 + f] ^ ' 

where ;/, a, /3, 7, 7', (5, A4, ?y are critical exponents and d is the spatial dimension. It is 
a well-known fact, however, that hyperscaling does not hold for systems above their 
upper critical dimension J„: for d > du the critical exponents are expected to be those 
of mean-field theory, and these exponents satisfy the hyperscaling relations only at 
d = du- Indeed, the hyperscaling relations (5.II)-(5.I4) have been proven rigorously 
to fail for Ising-like models in dimension d > 4 [2, 130, 7, 13, 6, 115]. 

The traditional explanation of hyperscaling — and of its failure — is the following 
[117, 118, 254]: Under an RG transformation H' = TZ^H) with linear scale factor /, the 
correlation length ^ and free energy density / transform as 

aH) = laH') (5.15a) 

f{H) = g{H) + l-'f{H') (5.15b) 
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where g is nonsingular. (In fact, for most RG maps these identities are only approxi- 
mate.) Near a fixed point H* we parametrize the Hamiltonian by scahng fields (/i, (/2, . . . 
with eigenvalues Z^^/^^,...; the variable gi is said to be relevant (resp. irrelevant) if 
yi > (resp. yi < 0). The critical surface corresponds to setting all the relevant scaling 
fields to zero. We can assume without loss of generality that gi is a relevant variable 
(j/i > 0). The asymptotic scaling laws then read 

^91,92,...) ~ lC{l'"9iJ'"92,...) (5.16a) 

^„,((/i,(/2,...) ~ l-'f,,nA.l''9iJ''92,...) (5.16b) 

Making the choice I = gi /^ we obtain 

«»■.»..»....) « k|-'»f(±l.^.^....) (5.17a) 

f....{9us..9s....) « l9.l*'/-.(±l.|^.|^.---) (5.17b) 

If now gi is an irrelevant variable (yi < 0), then gi/\gi\^''^^ --■ as gi —■ 0. It appears at 
first glance^ therefore, that for the purpose of determining the leading scaling behavior, 
the quantity gi/\gi\^''^^ on the right-hand sides of (5.17) can be replaced by zero. For 
example, if only the first two fields are relevant (the case of an ordinary critical point), 
we would obtain 

^91, 92, 9s, 94,...) ~ \g^\-'/^'cUl,j^^,0,0,..] (5.18a) 

fs^nA9l,92,9s,94,...) ~ \g^f/^'fs^nj±l,^^^,0,0,..] (5.18b) 

In particular, suppose that we set g\ = t (the temperature deviation from criticality) 
and g2 = h (the magnetic field). Then (5.18a) yields the scaling behavior of the 
correlation length: 

(-t)-''' ast^Q- I ^^^^ i^ = i^' = l/yt . (5.19) 

Likewise, (5.18b) and its derivatives yield the scaling behavior for the thermodynamic 
quantities: 

(a) Differentiating (5.18b) twice with respect to t and setting /i = 0, we obtain the 
critical exponents for the specific heat: a = a' = 2 — d/yt. Combining this with 
(5.19) yields the hyperscaling law dp = 2 — a. 



^^For a position-space RG map, this can of course be done only approximately, since / must be a 
power of the basic block size h. However, this is good enough for the purpose of obtaining critical 
exponents: by choosing / within a factor h of the desired value, one obtains the desired equality wtthm 
a hounded multtpUcattve constant. 
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(b) Differentiating (5.f8b) once or twice with respect to /i, then setting /i = 0, we 
obtain the criticai exponents for the spontaneous magnetization and the suscep- 
tibiiity: fi = [d — yu) lljt and 7 = 7' = (2j//i — d) jyt- Combining this with (5.f9) 
yieids the hyperscaiing iaw dv = ^' -\- 2/3. 

(c) Differentiating (5.f8b) four times with respect to h, then setting /i = (with 
t > 0), we obtain the criticai exponent for the four-point cumulant: 2A4 + 
7 = (4j//i — d)/yt. Combining this with (5.f9) and the formula for 7 yields the 
hyperscaiing law dp = 2A4 — 7. 

(Relations for exponents on the critical isotherm can be obtained in a similar manner 
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by setting gi = h and §2 = t = 0.) However, Fisher [117] pointed out that th 
reasoning is correct only (f /(</!, 5'2, 53, 54, • • •) o-nd its low-order derivatives have finite 
limits as (/3, (/4, . . . — )• when gi = ±1. If / or one of its low-order derivatives diverges as 
g^^g^, . . . — > 0, then the hyperscaiing relations can fail. A variable gi which is irrelevant 
in the RG sense but which provokes a divergence of the free energy density (or one 
of its low-order derivatives) is termed a dangerous irrelevant variable. We emphasize 
that the free energy is here being evaluated well away from the critical point, namely 
at (/i = ±1. 

The standard example of such a behavior is the (^"^ model in dimension J > 4. Here 
the fixed point is Gaussian, with relevant fields gi = t and g2 = h; the ip"^ coupling 
constant gs = u is irrelevant in the RG sense. However, the Gaussian model is unstable 
at nonzero magnetic field on the critical isotherm, and also at zero magnetic field below 
the critical temperature, and the irrelevant ip"^ term is needed to stabilize it. A mean- 
field calculation (which is expected to give the correct scaling for J > 4) predicts that 
the free energy diverges as u J, 0, as 

(5.20) 
(5.21) 

where H/^ is a well-behaved function. Inserting this behavior into (5.17b), one finds 
modified hyperscaiing laws which differ from (5.11)-(5.14) and which are consistent 
with the mean-field exponents. This behavior occurs because the fixed point H* is on 
the boundary of the stability region, and the free energy diverges as this boundary is 
approached. ^"^ 

Here we would like to make the trivial observation that such a blow-up of the 
free energy is possible only in models with unbounded Hamiltonians (such as the Lp"^ 
model). Indeed, we know that for absolutely summable interactions ($ G S^), the free 
energy density is a Lipschitz continuous function of the interaction (Propositions 2.56 



fit = 


-l,h,u) ^ u-^W{hu^l^ 


fit- 


= 0,/i,u) ^ u-^l^h^l^ 



^^See also Fisher [118, Appendix D] and Ma [254, Section VII. 4]. 

^^A qualitatively similar behavior is expected to occur also in dimension rf = 4. Here the dangerous 
irrelevant variable g^ = u is only marginally irrelevant (i.e. j/3 = 0, but second-order effects make g^ 
irrelevant), so that the violations of hyperscaiing are only logarithmic. 
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and 2.58). This means that the free energy density and its first derivatives are always 
bounded. This situation prevails in all physically sensible models of bounded spins. 

These considerations do not quite rule out the possibility of dangerous irrelevant 
variables: in principle it could happen that /(ibl, (/2,5'3, . . .) and its first derivatives 
are bounded, but that higher derivatives blow up. This would cause some or all of 
the hyperscaling relations to fail.^'* This is indeed what happens in the XY model in 
dimension J = 4 — e (and probably also J = 3) if we let gs be the coefficient of a cos nO 
single-site term, where n is even and > 4 [27f , fO]. Such a term is irrelevant in the 
RG sense (at least if e is small enough), but for T < Tc it suppresses the Goldstone 
modes. In J < 4 these modes give rise to a divergent longitudinal as well as transverse 
susceptibihty in the pure XY model [106, 243, 105], so that {d^f/dh'^){t = —l,h = 
O^gs) is finite for gs y^ but blows up as g^ —■ (presumably at the rate ~ (/g 
[271]). This means that the model with g3 ^ belongs to a Z„-symmetric but S0{2)- 
nonsymmetric universality class — which naively would not exist — and that in this 
universality class the relation 7' = {2yh — d) jyt [step (b) above] fails. As a consequence, 
the scaling law 7' = 7 fails, and is replaced by 7' = 7 — ^ey^/yt > 7 [271]. 

Other cases in which an apparently irrelevant term (in the RG sense) changes the 
phase diagram have been studied in [50, 10, 350]. 

However, we have not been able to construct any plausible Ansatz for such a be- 
havior in an Ising-to-Ising RG map for the Ising model in dimension J > 4. Nor do 
we know of any plausible candidate for the dangerous irrelevant variable. (In the Ising 
language there is no term in the Hamiltonian corresponding to the "(^"^ coupling" ; such 
a term is built into the a priori single-spin measure.) 

We conclude that the dangerous-irrelevant-variables scenario is probably not the 
correct description of what is happening in the Ising model in dimension J > 4, at 
least in the context of an Ising-to-Ising RG map. On the other hand, we know that 
the hyperscaling relations (5.II)-(5.I4) do fail for Ising models in dimension J > 4. 
Therefore, one of the other assumptions made in the conventional RG theory must fail 
when applied to Ising-to-Ising RG maps in dimension J > 4. 

For large-cell RG maps (6 — )• 00), the results summarized in Section 4.4 show 
that what fails is the Gibbsianness of the fixed-point measure //*. Now there is a 
very close similarity between the dangerous-irrelevant-variables scenario and the non- 
Gibbsianness proof for large-cell RG maps: both hinge on the fact that a massless 
Gaussian field is unstable to magnetic-field perturbations. This reasoning suggests 
that non- Gibbsianness of the fixed-point measure may occur also for iterated Ising- 
to-Ising transformations with fixed block size b (e.g. majority rule or the Kadanoff 
transformation). If this were the case, then the RG map TZ from Hamiltonians to 
Hamiltonians would be ill-defined at the critical Ising model, and the putative fixed- 



^■*Fisher [118, p. 134] states that the derivation of the hyperscaling relations relies implicitly on the 
assumption that the free energy /(±1, §2, §3, §4, ■ ■ ■) has a well-defined finite limit as gs, §4, . . . -^ 0. 
However, this statement is slightly misleading, because it is too weak: in fact, as is clear from (a)-(c) 
above, to derive a hyperscaling relation one needs to know that at least the second derivative of / 
with respect to t or /i has a good limit. 
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point Hamiltonian H* would simply not exist. 

6 Conclusions and Open Questions 

6.1 Conclusions 

6.1.1 How Much of the Standard Picture of the RG Map is True? 

We can classify the evidence regarding the validity or failure of the standard picture 
of RG transformations in three categories: 

1) Positive results. Some RG maps are well-defined in parts of the one-phase region. 
The published proofs refer to the following cases: 

(i) High-field results. Decimation and Kadanoff transformations for absolutely sum- 
mable lattice-gas [173] and Ising-spin [207] interactions. 

(ii) High-temperature results. Decimation [207, 212], Kadanoff [207] and averaging 
[212, 56] transformations for absolutely summable Ising-spin interactions. 

(iii) Small-field results. These results refer to decimation transformations of the Ising 
model in any dimension [258]: For any fixed temperature and nonzero value of 
the magnetic field, there exists a minimum block size bmin beyond which the 
renormalization transformation is well-defined (the minimum block size diverges 
as a power of 1/h when h — )• 0). 

(iv) Results in one dimen.sion. The decimation transformation is well-defined in di- 
mension d = 1 for lattice-gas interactions with many-body and long-range cou- 
plings satisfying the summability condition X]A9o(diamA -|- l)|A|~^||$^||oo < oo 
[59]. For instance, this includes all the two-body Ising interactions decreasing 
strictly faster than 1/r^. 

These results are, however, of limited interest, as they correspond to well-understood 
regions of the phase diagram, deep within the regime in which uniqueness of the Gibbs 
measure, and even analyticity of the free energy and correlation functions, can be 
proven. If these were the only positive results, then one would conclude, in agreement 
with Griffiths and Pearce's pessimist [172], that the method only works where one does 
not really need it (and, we may add, sometimes not even there, given Theorem 4.8). 

We can also mention, as positive results (of a sort), our Fundamental Theorems of 
Section 3 which say that the RG map is single- valued and continuous — in accordance 
with the standard picture — if it exists at all. 

2) Non-negative results. There is at present no evidence of RG pathologies above 
or at the critical temperature for models strictly below the upper critical dimension J„ 
(= 4 for Ising-like models). In fact, there exist models for which the critical point has 
been rigorously studied using the standard RG prescription: the hierarchical models 
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[218, 32, 219] and the Gross-Neveu model [149]. The hierarchical models present the 
most laithlul transcription ol Wilson's prescription, but from our point of view they 
are somewhat artificial as the possible pathologies are removed "by hand" . The Gross- 
Neveu model is fermionic, and thus has no direct probabilistic interpretation. We also 
should mention here some very interesting preliminary results [214] indicating that for 
the two-dimensional Ising model at zero field, the majority-rule transformation might 
be well-defined at (as well as slightly below) the critical temperature. These results are 
partially rigorous and partially numerical, and so far they concern only some selected 
(albeit judiciously selected) block-spin configurations. We feel that this work provides 
some support for the standard picture, but its results are still inconclusive. 

3) Negative results. There are pathologies at low temperature (not only at zero 
magnetic field) in all dimensions, and quite possibly at the critical point in dimen- 
sion d (^) du- In the former case these pathologies consist in the non-Gibbsianness 
of the renormalized measure, that is, in the impossibility of constructing a renormal- 
ized Hamiltonian after even a single RG transformation. In Sections 4.1-4.3 we have 
shown examples of such pathologies for all the standard real-space transformations 
(decimation, Kadanoff, majority-rule, averaging). The range of temperatures where 
these pathologies are proven to exist does not include the critical temperature, but on 
the other hand the pathological region extends off the phase-coexistence curve, i.e. to 
nonzero (and in some cases large) magnetic field (Section 4.3.6). Finally, in Sections 
4.4 and 5.2 we have given arguments indicating that for J(^)4 there may be pathologies 
at the critical point. In these latter cases our arguments suggest that the fixed-point 
Hamiltonian may be ill-defined. 

Taken together, these results suggest that non-Gibbsianness may be the normal 
situation for RG maps at low temperature and/or near a first-order phase-transition 
surface, or at the critical point in high dimensions. This is in direct confiict with the 
conventional RG ideology (compare the first paragraph of the Introduction). 

6.1.2 Responses to Some Objections 

Many of our colleagues, upon hearing our results, have initially reacted by saying: "If 
fi' is not Gibbsian for some interaction in B^^ then that just means it is Gibbsian for 
some interaction not in B^. You have to use a larger space of interactions." This 
view seems a priori reasonable — and it is even conceivable that it is correct — but 
unfortunately things are not quite so simple. Before asserting that //' is Gibbsian for 
some interaction $' ^ B^^ one first has to define what it means for a measure to be 
"Gibbsian" for a non-absolutely-summable interaction. Our notion of Gibbs measure 
relies on the DLR equations, and if the interaction fails to be absolutely summable 
(or at least convergent), then these equations simply do not make sense. It is thus 
incumbent on the advocate of "larger interaction spaces" to make precise what is the 
correspondence between measures and interactions that is to substitute for the DLR 
equations (and be equivalent to them when the interaction is absolutely summable). 
Now, as is usual when one is looking for the solution of an equation, there are two 
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complementary aspects — existence and uniqueness — and one wants preferably for 
both properties to hold. The existence is favored by enlarging the space of possible solu- 
tions, while the uniqueness is favored by narrowing it; and it is far from clear, a priori, 
whether there exists a space in which the solutions both exist and are unique. These 
general remarks can be exemplified in our statistical-mechanical problem. Within the 
class of Feller specifications (and hence a fortiori within B^), the Griffiths-Ruelle the- 
orem (Theorem 2.15, Corollary 2.18 and Proposition 2.59) guarantees the uniqueness 
of the specification for a given measure // (and hence the uniqueness modulo physical 
equivalence of the interaction). But the existence may fail, as we showed through nu- 
merous examples in Section 4. On the other hand, if we enlarge the space of allowed 
specifications by dropping the Feller (sa quasilocality) property, then the existence 
holds but the uniqueness fails spectacularly (see the Remark at the end of Section 
2.3.4). Similarly, if we enlarge the space of allowed interactions from B^ to S°, and 
generalize "Gibbs measure" to "equilibrium measure", then every ergodic measure of 
finite entropy density is the equilibrium measure for some interaction, but the unique- 
ness again fails spectacularly (see item 2 in Section 2.6.7). One certainly cannot develop 
a satisfactory RG theory in such pathological spaces. 

Furthermore, we have given strong arguments that any physically reasonable speci- 
fication must be quasilocal , at least in systems of bounded spins (see Section 2.3.3). On 
the other hand, in Sections 4.1-4.3 and 4.5.2 we have proven directly that the renor- 
malized measures are not consistent with any quasilocal specification. So even if there 
were to exist quasilocal specifications corresponding to interactions not in B^ , such 
specifications could not be of any relevance for our renormalization-group problem. ^^ 

Our final objection to considering spaces of interactions larger than B^ is that B^ 
is already too big! Indeed, the standard RG ideology [365] is that the RG fiow should 
take place in some space of "short-range" interactions, e.g. interactions which decay 
exponentially or at least like a sufficiently large power (e.g. \x\~'^ with p > d-\-l). This 
ideology is not a mere whim, but results from the need to explain universality of critical 
behavior: one needs to have an interaction space in which the unstable manifold of 
a given fixed point is finite- dimensional (i.e. there are finitely many relevant scaling 
fields). Now, such a behavior is impossible in a space of long-range interactions (such 
as B^), since in general the critical exponents will be altered by any perturbation that 
decays like |a;|~(''"'"^~^) with e > the critical exponent rj of the original model [292, 
Section 10.2]. Moreover, even the qualitative phase diagram is unstable to long-range 
but summable pair interactions [349, 332, 208]: that is, the Gibbs phase rule cannot 
hold in B^ or even in any B'^ . In order to have any hope of constructing a satisfactory 
RG theory, it is necessary to work in a space of "short-range" interactions, such as the 
space Bh for some h^ 1. 

A second comment which is often made is the following: "The RG map is always 



^^Note also that Sullivan [336] and Kozlov [222] have almost proven that every quasilocal speci- 
fication arises from an interaction in B^: see Theorem 2.12 and the Remarks following it, plus the 
Remark at the end of Section 2.4.9. 
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well-defined as a map from measures to measures; the pathologies come from trying 
to lift it to a map from Hamiltonians to Hamiltonians. So why not just stick with the 
RG map (1.1) on the space of measures?" 

This is a sensible question, which was already raised by Griffiths and Pearce [173, 
p. 534-535], and our answer is essentially the same as theirs: Many interesting things 
can^ indeed, be learned by studying the action of RG transformations on measures. 
For linear RG transformations, this is an ancient branch of probability theory that 
goes back to Gauss' and DeMoivre's investigations of the central limit theorem for in- 
dependent random variables, and which continues to this day in studies of central and 
non-central limit theorems for dependent random fields [201, 180, 273, 274, 58]; it is 
closely related to studies of triviality and non-triviality for scaling limits in statistical 
mechanics and quantum field theory [323, 71, 115]. For nonlinear RG transformations, 
this study is only beginning [281, 193], but we expect it to be fruitful as well. Unfor- 
tunately, not all of the RG theory can be carried out on the space of measures alone. 
For example, the critical exponent 7 measures the rate of divergence of the susceptibil- 
ity as the temperature approaches the critical temperature. Now, the susceptibility is 
the integral of the 2-point correlation function, and thus can be read off the measure; 
while the (inverse) temperature is the coefficient of some term in the Hamiltonian (e.g. 
the nearest-neighbor term in the case of the Ising model). Therefore, the exponent 7 
can be deduced only from a theory that relates the measure to the Hamiltonian (or 
interaction); it cannot be deduced solely from an RG map acting on the space of mea- 
sures. The same goes for the exponent ;/, which measures the rate of divergence of the 
correlation length as the temperature approaches criticality. On the other hand, the 
exponent ratio ^ j v measures the relative rate of divergence of two different aspects of 
the 2-point correlation function, and so can potentially be deduced from a measures- 
to-measures RG map. Likewise, the critical exponent rj measures the rate of decay 
of the 2-point function at the critical point, making no reference whatsoever to the 
temperature; therefore, it too can potentially be deduced from a measures-to-measures 
RG map. It follows that the scaling law ^ j v = 2 — rj also lies potentially within the 
purview of a measures-to-measures RG theory. 

6.1.3 Where Does All This Leave RG Theory? 

After the more-or-less cold exposition of facts of Section 6.1.1, and the additional clar- 
ification (pre-emptive defense) of the previous subsection, let us present some general 
remarks about the consequences of the present work for the RG enterprise. 

We think that there is already a substantial body of evidence indicating that the 
conventional RG theory, in its narrow sense of Hamiltonian-to-Hamiltonian maps, 
needs to be reexamined. However, this does not, in itself, detract in any way from 
the value and significance of the RG ideas that have pervaded much of today's statis- 
tical mechanics and quantum field theory. The RG philosophy — interpreted broadly 
to include various kinds of "multi-length-scale" and "coarse-graining" arguments — 
has been, and will continue to be, our main tool to analyze the otherwise inaccessible 
"intermediate temperature" regions, which fall beyond the reach of series expansions 
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or perturbative arguments and yet are the regions in which the most interesting phe- 
nomena take place. 

The main issue in the proper apphcation of RG theory to a particular problem is 
the choice of variables in which to express the model, along with the choice of the RG 
map. This was already understood by the founding fathers of the field. It corresponds 
to what Michael Fisher calls "aptness or focusability" of the transformation, and his 
own words are especially clear: 

For any given Hamiltonian or class of Hamiltonians there is not just one 
renormalization group — ^Hhe renormalization group" as some people say — 
but rather there are many that might be introduced, and one must question, 
for example, whether the process is best carried out in real space or mo- 
mentum space and so on. A "good" renormalization group must be "apt" 
or appropriate for the problem at hand, and it must, in particular, "focus" 
properly on the critical phenomena of interest. [118, page 82] 

Let us mention an illustrative example: The usual transformations involving averaging 
(or other kinds of "voting") over square blocks are designed mostly having ferromag- 
netic systems in mind. They are efficient for selecting the zero-momentum modes, 
which are indeed the modes that become critical in an ordinary ferromagnetic transi- 
tion. On the other hand, these transformations are unsuitable for studying antiferro- 
magnets because they do not distinguish the oppositely magnetized sublattices. This 
was remarked by van Leeuwen [357], who showed how a more careful design of the 
block shapes could overcome this deficiency (his proposal is depicted in Figure 2(d)). 
Thus, while most people imagine RG maps as acting in a huge space of Hamiltonians 
— including regions exhibiting various different types of phase transitions (ferromag- 
netic, antiferromagnetic and many others) — it is unlikely that any single RG map can 
exhibit well-behaved fixed points corresponding to all of these transitions. Rather, one 
must "custom-make" the RG map for each new physical situation. 

In this regard, our work — building on that of Griffiths, Pearce and Israel [172, 173, 
171, 207] — can be considered an extension of the preceding observations: "aptness" 
and "focusability" are needed not just to ensure the usefulness of the map, but even its 
very existence. On the other hand, the success stories of rigorous RG studies teach us 
that the search for this "aptness" may require a very open-minded attitude, in the sense 
that, in many cases, the appropriate variables are not necessarily spin variables and, in 
fact, not even local objects. Indeed, with the exception of hierarchical [218, 32, 219] and 
fermionic [149] models, rigorous RG studies have not implemented the strict Wilson 
prescription involving an RG transformation of Hamiltonians written in terms of spin 
variables. Rather, they have employed a combination of spin variables and polymer 
ensembles [147, 148, 150, 151, 181, 188] or a pure polymer ensemble [53] when studying 
critical phenomena, or an ensemble of Peierls-like contours [145, 146, 44] when studying 
first-order phase transitions. More generally, "multi-scale" and "coarse-graining" ideas 
have been used in a wide variety of problems, including: 

• ultraviolet stability of the (p^ [159, 28] and Yang-Mills4 [19] quantum field theories; 
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• the Kosterlitz-Thouless transition in the two-dimensional XY and related models 
[134, 135]; 

• the ferromagnetic transition in the one-dimensional 1/r^ Ising model [137]; 

• confinement in the three-dimensional U{1) lattice gauge theory [166]; 

• localization for random Schrodinger operators [138]; 

• the phase transition in plaquette percolation [4]; 

• the intersection properties of ordinary random walks and of Brownian motion 
[3, 112, 115]; and 

• the critical behavior of self-avoiding walks [52, 185, 187, 186], percolation [183, 
182] and branched polymers [184] in high dimensions. 

In many of these examples, the "coarse-graining" is applied at the level of objects with 
some geometric content, such as random walks, clusters, surfaces, contours, etc. 

Thus, our work is in no way an attack on the essential physical ideas behind the RG 
approach. It simply points out the need for a more general definition of their scope. 

6.1.4 Towards a Non-Gibbsian Point of View 

Let us close with some general remarks on the significance of (non-)Gibbsianness and 
(non-)quasilocality in statistical physics. Our first observation is that Gibbsianness 
has heretofore been ubiquitous in equilibrium statistical mechanics because it has been 
put in by hand: nearly all the measures that physicists encounter are Gibbsian because 
physicists have decided to study Gibbs measures! However, we now know that natural 
operations on Gibbs measures can sometimes lead out of this class: among such op- 
erations are some renormalization transformations (Sections 4.1-4.3 and 4.5.2), some 
nonlinear local functions (Section 4.4), convex combinations (Section 4.5.1), and weak 
limits (Section 4.5.6). It is thus of great interest to study which types of operations 
preserve, or fail to preserve, the Gibbsianness (or quasilocality) of a measure. This 
study is currently in its infancy. 

More generally, in areas of physics where Gibbsianness is not put in by hand, 
one should expect non-Gibbsianness to be ubiquitous. This is probably the case in 
nonequilibrium statistical mechanics (Section 4.5.4). 

Since one cannot expect all measures of interest to be Gibbsian, the question then 
arises whether there are weaker conditions that capture some or most of the "good" 
physical properties characteristic of Gibbs measures. For example, the stationary mea- 
sure of the voter model appears to have the critical exponents predicted (under the 
hypothesis of Gibbsianness) by the Monte Carlo renormalization group [362], even 
though this measure is provably non-Gibbsian [246]. 

One may also inquire whether there is a classification of non-Gibbsian measures 
according to their "degree of non-Gibbsianness". Joel Lebowitz has suggested to us 
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the analogy with the rational and real numbers: although the set of rationals is very 
"small" in many senses (e.g. first Baire category, zero Lebesgue measure), it is "large" 
in the weak sense that any real number can be approximated by a sequence of rational 
numbers; and the irrational numbers can be classified according to the rate at which 
they can be approximated by rationals (Diophantine approximation). In Section 4.5.6 
we conjectured a similar scenario for the Gibbsian measures within the space of all 
measures. It would then be natural to classify the non- Gibbsian measures according 
to how well (or how rapidly) they can be approximated by Gibbsian ones. 

Finally, there is a philosophical question, raised by one of our colleagues in Rome 
(to whom we apologize because we cannot remember his name): All mathematical 
modelling, in any branch of science, involves selecting the "important" variables in 
the description of a system and neglecting the variables judged "unimportant". In a 
statistical system this means that the "unimportant" variables are integrated out, i.e. 
one performs a kind of "decimation" transformation. Now, if the decimated variables 
are only weakly coupled to the others, then one may hope that the decimation will 
lead to a Gibbs measure (although rigorous theorems guaranteeing this seem to be 
lacking). However, one could also fear that the result of the decimation might be a 
non-Gibbsian measure, especially if the decimated variables are strongly coupled to 
the others. (Such variables might still be deemed "unimportant" if they were believed 
to affect only uninteresting quantitative details of the problem, without changing the 
features of interest.) In this case, not only would one be making an approximation in 
describing the system by a particular "model Hamiltonian" , but even the description 
of the decimated system by any Hamiltonian would itself be an approximation. And 
one would have to investigate how good this approximation is. 

6.2 Some Open Questions 

We end with a list of open questions for future research: 

1) Clean up the circle of results connected with the Gibbs Representation Theorem 
(Theorem 2.12), particularly in the translation-invariant case (Sections 2.3.3, 2.4.9 and 

A. 2). 

2) Determine whether {$: II^Hb^ ^ M} + J7 is a closed subset of S°, if /i ^ 1, and 
in particular for Bh = B^ (Sections 2.4.4 and 2.4.6). This affects the ways in which the 
RT map can blow up at the boundary of its domain (Section 3.3), and arises also in 
our theory of parameter estimation (Section 5.1.2). 

3) Devise a clean general theory for systems of unbounded spins, analogous to the 
spaces S° and B^ for systems of bounded spins (Sections 2.4.4 and 3.1.4). 

4) Investigate rigorously the Gibbsianness or non-Gibbsianness of the renormalized 
measure in the following models: 

(a) Ferromagnetic Ising model, using the decimation transformation with 
spacing b: does the cutoff temperature for non-Gibbsianness tend to 
Jc as 6 —)• oo? (See Section 4.3.2.) 
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(b) Ferromagnetic Ising model, using the majority-rule transformation 
with block sizes b not covered by the construction in Section 4.3.4. 
(For dimension J > 3, it appears that no block sizes b are covered by 
this construction: see Appendix C.) 

(c) Ferromagnetic Ising model at low temperature and nonzero magnetic 
field, in dimension J = 2, using the decimation, KadanofF or majority- 
rule transformation. 

(d) Antiferromagnetic nearest-neighbor Ising model in a uniform magnetic 
field, on the paramagnetic-antiferromagnetic critical surface: compare 
the majority-rule (or Kadanoff) transformation on square [bx b) blocks 
to the same transformation on van Leeuwen's 5-spin blocks [357, 55]. 

(e) q-state Potts model with q large, at (or near) the first-order phase 
transition, using either the ordinary "plurality-rule" (or Kadanoff) 
transformation [305] or the modified transformation including vacan- 
cies [279, 305]. 

(f) Other models at or near a first-order phase transition. 

(g) Ferromagnetic Ising model at the critical point in dimension J > 4, 
using a majority-rule (or Kadanoff) transformation with fixed block 
size b (Section 4.4). 

5) Improve/generalize the theorems on non-Gibbsianness of local nonlinear func- 
tions of an anharmonic crystal, and in particular try to prove non-quasilocality (Section 

4.4). 

6) Try to generalize Schonmann's example (Section 4.5.2) to dimensions J, d' other 
than J = 2, J' = I. 

7) Prove (or disprove) the existence of measures consistent with the Fortuin-Kasteleyn 
random-cluster-model specification (4.91); in particular, prove (or disprove) that the 
infinite-volume limit measures taken with free or wired boundary conditions are con- 
sistent with this specification (Section 4.5.3). 

8) Investigate the Gibbsianness or non-Gibbsianness of the stationary measure(s) 
in various stochastic evolutions not satisfying detailed balance (Section 4.5.4). 

9) Investigate the abstract properties of the set Q of Gibbsian measures (Sections 
4.5.6 and 6.1.4). 

10) Investigate rigorously the model of parameter estimation introduced in Section 
5.1.2; in particular, try to prove Conjecture 5.1 or some weakened version of it. 

11) Make a high-precision MCRG test, using a large space of renormalized interac- 
tions, to compare a case in which the renormalized measure is expected to be Gibbsian 
(e.g. the d = 2 Ising model at a temperature not too far below critical) with a case 
in which the renormalized measure is expected or proven to be non-Gibbsian (e.g. the 
d = 2 Ising model at low temperature) [Section 5.1.3]. 
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12) Clarify the relationship between RG transformations acting on contours [145, 
146] or polymers [147, 148, 150, 151, 181, 188, 53], and the traditional RG transforma- 
tions acting on spins. 

13) Discuss the Gibbsianness or non-Gibbsianness of various states of quantum 
lattice systems [253, 355]. Here one problem is to understand better the relationships 
between the various alternative notions of "Gibbsianness" in the quantum case. 

14) Prove Conjecture C.5. 

A Proofs of Some Theorems from Section 2 
A.l Proofs and References for Section 2.1 

The remarks made in Section 2.1.2 are all well-known results. Here are some references: 

(a) Oo compact =^ ft compact =^ every continuous function on is bounded 
[309, Proposition 9.4]. The density of Ciodft) in C(0) is an easy consequence of the 
Stone- Weierstrass theorem [309, Theorem 9.28]. 

(b) If Oo is discrete, then every local function is continuous; and continuity is 
preserved under uniform convergence. 

(c) This is an immediate consequence of (a) and (b). 

Further Remark. If the single-spin space Oq is noncompact, there may exist 
bounded continuous functions which are not quasilocal. Hans-Otto Georgii provided 
us with the following example: take C = fto = Z and let /(cr) = cr^j^ (!); then let g be 
a bounded function of /, say g = |/|/(1 + |/|). 

In fact, this construction can be imitated whenever Oq is a noncompact metric 
space and C is infinite: Let /i, /2, . . . G C(0o) have disjoint supports Si^ 5*2, . . . with 
Si n [j Sj = and ll/illoo = 1 (such functions are easily constructed using Urysohn's 

lemma); let Xq, Xi, 2:2, . . . be distinct sites in C; let g G C(0o) be non-constant; and 
define h{(7) = J2n=i 9{'^xn)fn{crxo)- 

Standard references for the theory of probability measures on metric spaces are 
the books of Parthasarathy [289] and Billingsley [29]. Probability measures on general 
(not necessarily metrizable) topological spaces are treated in [358, 73, 320]. The Riesz- 
Markov theorem is [309, Theorem 14.8] or [289, Theorems II. 5. 7 and II. 5. 8]. The 
theorem on support of a measure is [289, Theorem II. 2.1]. 

The bounded measurable topology on Af(0) and Af_|_i(0) is discussed in [143]. The 
weak topology on Af_|_i(0) is discussed in detail in [289, 29, 358]; in particular, the 
topological properties of Af_|_i(0) for different classes of spaces are discussed in [358, 
Part II], [289, Section II.6] and [73, Theorem III-60]. 

If Oo is a separable metric space, then every uniformly continuous function on is 
quasilocal [157, Remark 2.21(2)]. Since the bounded uniformly continuous functions are 
sufficient to generate the (ordinary) weak topology (this is the famous "portmanteau 
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theorem" [29, Theorem 2.1]), it follows that the bounded quasilocal topology is stronger 
than the weak topology. On the other hand, if Oq is also discrete (hence countable), 
then every quasilocal function is continuous, so the two topologies in fact coincide. See 
[155, Remark 0.3]. 

A. 2 Proofs and References for Section 2.3 

Proposition 2.7 is essentially [157, Remark 1.24]. Examples 1 and 2 in Section 2.3.3 
are [157, Proposition 2.24 and Example 2.25]. Theorem 2.10 and related results are 
discussed in [157, Section 2.2]. Theorem 2.12 is proven by Kozlov [222]; see also Sullivan 

[336]. 

Remarks. 1. The following conjectured extensions of Theorem 2.12 appear to be 
open questions: 

(a) If n is quasilocal and nonnuU (but not uniformly nonnuU), and Oq is not finite, 
does there exist a uniformly convergent interaction $ such that 11 = H*? [This theorem 
might be relevant to models of unbounded spins with finite-range interactions.] 

(b) If n is quasilocal, uniformly nonnuU and strongly Feller in the sense that / G 
BiJl^J^f^) implies tta/ G Cg;(0), and Oq is not finite, does there exist a continuous 
absolutely summable interaction $ such that 11 = 11*? 

2. Regarding the relation between quasilocality and the Eeller property, the fol- 
lowing appears to be an open question: If Oq is compact but not finite, can a Eeller 
specification fail to be quasilocal? 

Proof of Theorem 2.15. Let // be consistent with Eeller specifications IIi and 
112. Then 

E,{f\rt..){u) = (7riA/)(cu) = (7r2A/)(cu) /i-a.e. (A.l) 

for each / G C(0). Now, since // gives nonzero measure to every open set, two con- 
tinuous functions which agree /i-a.e. must in fact agree everywhere. So we must have 
(7I"ia/)(^) = {T^2kf){'^) for all u. But if the two measures 7riA(cu, • ) and 7r2A(^, • ) give 
equal expectations to each continuous function /, then they must be equal. ■ 

Eurther examples of pathological non-quasilocal specifications, along the lines of 
the Remark at the end of Section 2.3.4, are given by Georgii [157, pp. 34-35]. 

Theorem 2.17 and Corollary 2.18 are proven in [157, Theorem 2.34]. Propositions 
2.19 and 2.20 are [157, Proposition 7.9 and Theorem 7.7]. Proposition 2.22 is almost 
immediate from the definition of Eeller specification and weak convergence; for related 
results, see [157, Sections 4.3 and 4.4]. Proposition 2.23 is proven in [157, Theorem 
7.12]. 

Proof of Proposition 2.25. Recall that /i'^( • ) is a regular conditional probabil- 
ity for /i given JF^, i.e. it depends on u only through cua; and we are interested only in 
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its restriction to jF^c, i.e. we want to study the measure fi^^[d(jj'^c). The claim is now 
that for fi-a.e. cua, we have 



/i--(Jcule)7r--Ke,A) = iJ--{A) (A.2) 

for all A G J-'a^ and all A C A^. Both sides of this equation are jFA-measurable. So it 
suffices to prove that for all / G BiJl^J^^) we have 

^M(tuA)/(OA)y'/i"^(icu;,eX^Ke,A) = JdfiA{uA)f{nA)fi'^^{A). (A.3) 

Now the right-hand side of (A.3) is J fxA dfi^ by definition of regular conditional prob- 
ability. As for the left-hand side, let us rewrite it as 

MM(cuA)/i"M^^kO]/(f^A)7rrKc,A) ; (A.4) 

this passage from an iterated integral to a single integral on the product space is 
justified by [272, Proposition III-2-f]. But the measure in brackets in A.4 is precisely 
J/i(c<;A,cu^c); so the left-hand side of (A.3) equals 

dfi{uJA,Oj'Ac) f{^A)'^A{l^A XUj'^c,A) , (A. 5) 

where we have now inserted the definition (2.37) of vr^^. We now use the fact that fi is 
consistent with 11, and that A C A^ (so A C A^); it follows that (A. 5) equals JfxA dfi. 



A.3 Proofs and References for Section 2.4 
A.3.1 Van Hove Convergence 

Proof of Proposition 2.27. It is easy to see that 

x G A =^ dist(a;,A'') = dist(a;, 9^+A) (A. 6a) 

X e A" =^ dist(a;,A) = dist(a;, 9]"A) (A. 6b) 



Therefore 



\d;A\ < (2r + l)'^|a+A| (A.7a) 

\d+A\ < i2r + lY\d^A\ (A.7b) 



It follows that (a)-(c) are equivalent. 
Next notice that 



xeA\{A + a) =^ x G A and dist(a;. A'') < |a| (A. 8a) 

xe{A + a)\A =^ x G A'' and dist(a;. A) < |a| (A. 8b) 
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Therefore (c) implies (d) and (e). Conversely, 



d-A = U [A \ (A + a)] (A.9a) 

|a| = l 

d+A = \J[iA + a)\ A] (A.9b) 

|a| = l 

SO (d) =^ (a) and (e) =^ (b). 
Finally, 

AA(A + A) C \J[AA{A+{a})], (A.IO) 

aeA 

SO (d) and (e) together imply (f). On the other hand, taking A = {a} shows trivially 
that (f) implies (d) and (e). 

This completes the proof of equivalence of (a)-(f). 

Next we prove that lim^^oo |A„| = oo: this follows immediately from (a) and the 
fact that |9]~A| > 1 whenever A and A^ are both nonempty. 

Finally, let us prove statement (/3): For each n, choose a„ G Z and r„ G Z_|_ so 
that Br„{an) = {x G Z : |a; — a„| < r„} is a maximum-sized ball contained in A„. 
We claim that lim^^oo ?"ra = oo- Proof: Fix any r > 0. Since lim^^oo |A„| = oo 
and lim^^oo |^r^>i|/l^>i| — O5 ^^ clearly have lim^^oo |A„ \ 9~A„| = 00 and hence in 
particular A„\9~A„ ^ for all sufficiently large n. But A„\9~A„ ^ is just another 
way of saying that rn> r. ■ 

Remarks, f . Many books [3f2, 206, 40] use a more complicated definition of van 
Hove convergence, based on a paving of Z by cubes of side a. It is easy to see that 
this definition is equivalent to conditions (a)-(f). 

2. What physicists call van Hove convergence is termed F0lner convergence by 
mathematicians. Much of the theory extends, in fact, to locally compact amenable 
(semi)groups [167]. See [223, Section 6.4] for ergodic theorems in this context. 

A. 3. 2 Translation-Invariant Measures 

Proposition 2.30 is [157, Theorem 14.5 and Proposition 14.7]. Proposition 2.31 is [157, 
Corollary 14. A5 and Theorem 14. A8]. For more information on ergodic theorems, 
along with some relevant counterexamples, see [223, pp. 222-226]. Proposition 2.32 is 
proven in [157, Theorem 14.12] or [206, Lemma IV. 3. 2]; a stronger form will be proven 
as Proposition 2.61(e) below. Information on the Poulsen simplex can be found in 
[252, 284]. 

A. 3. 3 A Digression on Subadditivity 

An important role in the theory of translation-invariant lattice systems is played by 
the concept of a subadditive set function. Subadditivity arguments will be used to 
prove the existence of the infinite-volume limit for the pressure, the entropy density. 
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and quantities connected with the quotient norm. We therefore collect here the needed 
results. 

Definition A.l Let S be the class of all nonempty finite subsets of Z , and let S* = 
S U {0}. A function F: S* — > [—00, 00) is called 

• subadditive (/^(AiUAa) < F{Ai) + F{A2) whenever Ai, A2 G S* with AinA2 = 


n 

• completely subadditive if F[A) < Y, KF{Ai) whenever A, Ai, . . . , A„ G S* with 



8 = 1 



Xa= J2 ^tXA, and all X^ > 



8 = 1 



• strongly subadditive if F{Ai U A2) + F{Ai D A2) < F{Ai) + ^(^2) whenever 
Ai,A2G5* 

Clearly, complete subadditivity implies subadditivity. The key nontrivial fact is: 

Lemma A. 2 ([268, Tlieoreme 2]) If F is strongly subadditive and F{0) > 0, then 
F is completely subadditive. 

Remark. If F is subadditive, then either F[0) > or else F = —00. In our 
applications we will always have F[0) = 0. 

We can now state the two principal theorems on the existence of the infinite- volume 
limit: 

Proposition A. 3 Let F: S* — > [—00,00) be translation-invariant and completely sub- 
additive. Then lim |A|~^F(A) exists and equals inf |A|~^F(A). 

Proposition A. 4 Let F: S* — > [—00,00) be translation-invariant and subadditive. 
Then lim |A„|~^F(A„) exists for any van Hove sequence (A„) satisfying the addi- 

tional condition |A„|/diam(A„) > (5 > for some 8 > {). Moreover, this limit equals 
inf |C„|-iF(C„). 

ri>l 

We note that ordinary subadditivity is not sufficient for the existence of the van 
Hove limit; a counterexample has been given in [175]. 

Proof of Proposition A. 3. This result is stated in [267, Theoreme 0] and proven 
in [268, CoroUaire 10], but the proof is rather difficult to follow. For completeness let 
us give an elementary proof [333]: 

Let A, i? G iS; without loss of generality let us suppose that G -B. Now consider 
the decomposition 

^^ = H Tr|^^+" "^ ^ ^xX{x} (A. II) 

a:B+aCA I I xeA 
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where 

\{a: B -\- a 3 X and B -\- a (^ A]\ 

^T -— mn • 

\B\ 

Clearly < A^; < 1; and by summing (A. 11) over y (zl- we find 



(A.12) 



^A. = 


= A\- ^{A-h) 


x^A 


beB 




= A\f]iA-b) 




beB 


= 


= rn-^iA). 



since 



oeB] 



(A.13) 



By complete subadditivity and translation-invariance it follows from (A. 11) and (A.13) 
that 



a:B + aCA 

F(B 



B 



xeA 



\B\ 
F(B 



beB \xeA I 

[|A|-m5(A)] + m5(A)F({0}) . 



(A.14) 



Now divide by |A| and take A Z' oo (van Hove): by Proposition 2.27(d) we have 
lim ?7i^(A)/|A| = 0. Therefore 

A/'oo 



F(A) F(B) 
lim sup , . , < , „, 

A/'co \A\ \n\ 



(A.15) 



Since this holds for all i? G iS, we have 



F(A) . ^ F(B) ^. . F(B) 

lim sup , , , < ml — — — < lim ml — — — 

A/oo \A\ Bes \B\ B/oo \B\ 



(A.16) 



Proof of Proposition A. 4. This is essentially [198, Proposition 4.10]. See also 
[175, 333]. ■ 

Remarks. 1. The important concept of complete subadditivity was apparently 
first introduced by Moulin- OUagnier and Pinchon [267, 268]. 

2. The proofs given here actually work (after slight notational changes) in an 
arbitrary discrete amenable group [333]. A slightly different proof of Proposition A. 3, 
also valid for discrete amenable groups, is implicit in [266, proof of Theoreme 2]. For 
an extension to locally compact amenable groups, see [268]. 
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A. 3. 4 A Lemma on Sums of Translates 

Next we use subadditivity arguments to prove an important lemma concerning the 
infinite-volume limit of sums of translates of a function /. This lemma will play an 
important role in our study of the quotient seminorms. 

First let us introduce a convenient notation: for any g G Bijt), let us define the 
maximum, m.inim.um. and m.idpoint values of g by 



sup (/ 
inf g 

midg 



sup g[uj) 
inf qiio) 



sup5'(cu) + inf 5'(cu) 



^(sup (/ + inf (/) 



Clearly we have 

Halloo = max(sup (/, — inf (/) 
||5'||B(fi)/const = Ksups- -inffl-) = lis- -mids' 
Lemma A. 5 Let f (^ B{fl). Then: 

lA 



fa) lim 

* ^ A/oo 

fb) lim 

* ^ A/oo 

fc) lim 

* ^ A/oo 

fd) lim 

* ^ A/oo 



/^e) lim 

* ^ A/oo 



"^ sup X] Taf exists and equals inf |A| ^ sup Y, Taf. 
"^ inf X] 2^a/ exists and equals sup |A|~^inf Y, Taf. 

aeA AeS aeA 



Y Taf exists and equals inf |A| ^ Y Taf 

aeA °° Ae5 agA 



aeA 



exists and equals inf |A| ^ Xl Taf 

B(Q)/const AeS aeA 



(A. 17a) 
(A.17b) 

(A.17c) 

(A. 18a) 
(A. 18b) 



B(n)/const 



"^ mid J2 Taf exists and lies in the interval [inf /, sup/]. 
\aeA I 



Proof, (a) Consider the set function F+(A) = sup YaeA Taf ^ defined for finite sub- 
sets A C Z . Clearly F is finite- valued and translation-invariant. Moreover, it is 

n 

completely subadditive (Definition A.l): if A, Ai, . . . , An G S with xa = Y ^iXA, and 

8 = 1 

all Aj- > 0, then 

F{A) = sup J2{Taf){u) 

^^^ aeA 

n 

= sup^A, Y,{Taf){'^) 
^efi 8=1 aeA, 
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< ^A, sup^(r„/)H 

i=l ^^^ aeA^ 

n 

= ^A,F+(AO. (A.19) 

8 = 1 

Proposition A. 3 then implies that hm |A|~^F+(A) exists and equals inf |A|~^F+(A). 

A/'oo A£5 

(b) is simply (a) applied to the function — /. 

(c) Consider the set function F[A) = || ZlaeA ^^a/lloo! the proof is then as in (a). 

(d) This is an immediate consequence of (a) and (b) together with (A. 18b). [Or it 
can be proven directly by applying complete subadditivity to Fc{A) = || Y^aeA Taf \\B(n)/ const- 

(e) This is an immediate consequence of (a) and (b). ■ 

Remark. For / G Bqi[ft) we can prove this lemma by a slightly different argument 
based on the fact that the set functions F+, F and Fc are "almost additive" (and not 
merely subadditive). Since the argument is virtually identical to that used by Israel 
in proving the existence of the pressure [206, Theorems 1.2.3 and 1.2.4], we give only a 
brief sketch. For simplicity let us consider part (c); the other parts are similar. 

Suppose first that / is a bounded local function, i.e. that / G BiJl^J^x) where 
diam(X) < D. Then it is easily seen that F(A U A') = 2F(A) whenever A' is a 
translate of A with dist(A, A') > D. [Here it is important that the configuration space 
is a product space, so that arbitrary pairs of configurations in A and A' are compatible 
(i.e. there are no hard-core exclusions). It also seems to be important that A' be a 
translate of A: this guarantees that we can choose configurations in A and A' that give 
Y^ae^Taf and Y^aeA'Taf near-maximum values of the same sign.] Moreover, for any 
two sets Ai, A2 we obviously have \F[Ai) — F[A2)\ < IA1AA2I ||/||oo- From these two 
facts one can prove the van Hove convergence of |A|~^F(A): the idea is to pave a large 
set A by medium-sized cubes (of side a which will eventually go to infinity) separated 
by corridors of width D. See [206, pp. 10-13] for details. The extension to general 
/ G Bqi[ft) is now a routine approximation argument. 

A. 3. 5 The Quotient Seminorm 

In Section 2.4.3 we stated Proposition 2.34 for the case of a compact m.etric single-spin 
space Oo and for a continuous function /. Here we prove a more general result in which 
these two restrictions are lifted: 

Proposition A. 6 (= Proposition 2.34') Let f G B{fl). Consider the following 
properties: 

(a) f has zero mean with respect to every translation-invariant probability measure, 
i.e. Jfdfi = for all fi G Af_|_i^m^,(0). 

(b) f has zero mean with respect to every translation-invariant finite signed mea.sure, 
i.e. Jfdfi = for all fi G Minv{^)- 
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(c' ) f lies in Xj = closed linear span of {/ — Taf: a G Z }. 
(c") f lies in Tb{u) = closed linear span of {g — Tag: g G -B(O), a G Z }. 

= 0. 



(d) limn-' E TJ 

(e) lim |A|-i E TJ 



A/'oo 



aeA 



0. 



Then (a) <^=^ (^6J •i== (c' ) <^=^ (^c"J <^=^ (d) <^=^ (^ej. Moreover, if Oq "s « com- 
pact metric space and f G C{ft), then all these properties are equivalent. [In this 
case property (c) of Proposition 2.34 ^-^ intermediate between (c'J and (c"), hence also 
equivalent.] 

Proof, (a) =^ (b): If // G Afm^,(0), then fi^^fi^ G Minv{^) [otherwise the Jordan 
decomposition of fi into positive and negative parts wouldn't be unique]. So every 
/W G Minv{^) is a hnear combination of two measures in Af_|_i^m^,(0). 

(b) =^ (a): TriviaL 

(c') =^ (c"): TriviaL 

(c") =^ (e): Assume that f = g — Tag with g G B[ft). Then 

\\J2^^f\\°° = \\J2^^9 - J2 ^^5'lloo 

xeA xeA xeA-+a 

< |AA(A + a)|||(/||oo (A.20) 

where A denotes symmetric difference. By Proposition 2.27, |AA(A + a)|/|A| — )• as 
A /* oo (van Hove). This proves the claim for functions / of the given form. The same 
obviously holds for finite linear combinations. It is then routine to pass to norm limits. 

(e) =^ (d): Trivial. 

(d) =^ (c'): hn = f - n~' J2aeCn Taf lies in the linear span of {/ - Taf: a G I"^}, 
and lim^^oo \\hn - /||oo = 0. 

(d) =^ (b): Since // is translation-invariant, //(/) = n~'^ J2aeC'n l^i^af) for all n. 
Hence |/i(/)| < \\fi\\ \\n~'^ J2aec„Taf\\oo- Now let n ^ oo. 

(b) =^ (c) =^ (c"), if Oo is compact and / G C(0): Suppose that / ^ Tc(n) = 
closed linear span of {g — Tag: g G C(0), a G Z }. Then, by the Hahn-Banach 
theorem, there exists / G C(0)* such that / t^c(fi) = and /(/) = I. By the Riesz- 
Markov theorem, / arises from some fi G Af (0) and / t^c(fi) = means precisely that 
/W G Minv{^)- But then (b) implies that /(/) = 0, a contradiction. ■ 

Remarks. I. Variants of this Proposition seems to be well known [198, p. 454] 
(see also [66, pp. 39-40] for a similar argument), but we have not been able to find a 
published proof. See also [313, Exercise 7.2] for a related result. 

2. We do not know whether (a)-(b) are equivalent to (c')-(e) in general; or if not, 
under what minimal extra conditions this equivalence can be proven. For aesthetic 
reasons, if no other, it would be desirable to resolve this question. 
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Next we prove an analogue of Proposition A. 6 in which we quotient out constant 
functions: 

Proposition A. 7 Let f G B[ft). Consider the following properties: 

(a) f has the same mean with respect to every translation-invariant probability mea- 
sure, i.e. Jf dfi = J f dp for all /«, ^^ G Af_|_i^m^,(0). 

(b) f has zero mean with respect to every translation-invariant finite signed mea.sure 
of zero total mass, i.e. Jf dfi = for all fi G Minv{^) satisfying /u(0) = 0. 

(c'J f lies in Xj + const = closed linear span of {/ — Taf: a G Z } and constant 
functions. 

(c" ) f lies in Tb{u) + const = closed linear span of {g — Tag: g G B{fl), a G Z } and 
constant functions. 

(d) hm n-^ E TJ = 0. 

"^°° aeC„ B(a)/const 



(e) hm |A|-i E TJ = 0. 

Then (a) <^=^ (b) -^= (c'J <^=^ (c") <^=^ (d) <^=^ (e). Moreover, if fto is a compact 
metric space and f G C{ft), then all these properties are equivalent. 



Proof, (a) => (b): If // G M^nvi^) with //(O) = 0, then i^+,i^- G M^nvi^) with 
/i+(0) = /U-(0) = A > 0. If A = we are done; if A > 0, apply (a) to the measures 



(b) 



(a): Just apply (b) to fi — p. 
(c"): Trivial. 
(e): Assume that f = g — Tag + c with g G B[ft) and c G R. Then 



xeA 



B{n)/const 



= \\J2T^9 - J2 T^g + c\A\ 

xeA xeA-+a 

< \\J2t^9 - J2 ^^9 

xek xek+a 

< |AA(A + a)|||5||oo . 



B{n)/const 



(A.2i: 



The rest is as in Proposition A. 6. 

(e) =^ (d): Trivial. 

(d) =^ (c'): Let c = lim^^oo n~'^ niid(Eaec„ Taf) as guaranteed by Lemma A. 5(e) 
Then hn = f — n~'^ Haec„ Taf lies in the linear span of {/ — Taf: a (^ Z }, and 

limsup||(/i„ + c) - /||oo = limsup n'"^ ^ Taf - 



It — ^'j^ ^ r^ 

aeOn 

< limsupn"'' ^ Taf 

a£0„ 

= . 



B(n)/const 



(A. 22) 
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(d) =^ (b): A trivial modification of tlie corresponding proof in Proposition A. 6. 

(b) =^ (c) =^ (c"), if Oo is compact and / G C(0): Same as in Proposition A. 6, 
but use the subspace Tc(n) + const in place of Tc(n) ; the signed measure fi will then 
have zero total mass. ■ 

Next we prove a strengthened version of Proposition 2.35. Again we can allow 
an arbitrary (not necessarily compact) single-spin space Oq, and an arbitrary (not 
necessarily continuous or quasilocal) function /. The only subtlety is that in this case 
we must choose the correct definition of X, since (a)-(b) and (c')-(e) are not necessarily 
equivalent. The right definition turns out to be (c')-(e). 

Proposition A. 8 (= Proposition 2.35') Let f G 5(0). Then 

lim \A\-'\\J2TJ 



and 



lim \K\-'r£TJ 



CO 


aeA 


oo 




(A.23a) 




= / B(fi)/J 




(A. 23b) 


nst 


aeA 
~ •' B(Q.)l(I+const) 


B(S 


~l) 1 const 


(A.24a) 
(A. 24b) 



for all closed linear suhspaces X satisfying If C T C Xb(q,)- 

Proof. In Lemma A.5(c,d) we have proven the existence of the limits and their 
equality to the corresponding infima. Now we want to identify the limits with the 
quotient seminorms. 

Let us denote by Lj the limit (A. 23a). Clearly Lj < ||/||oo- Moreover, by Proposi- 
tion A. 6 (c') =^ (e), Lf = Lf, whenever f - f e Tb(U)] hence Lf < \\f\\B{n)/iB^a) ^ 

\\j\\b{u)/i- 

To prove the reverse inequality, note that by an easy corollary of the Hahn-Banach 

theorem [306, Corollary 3 of Section IIL3] there exists / G B[ft)* such that ||/|| < 1, 

Kf) ~ ll/ll_Brfii/j ^^^ l\T = 0. On the other hand, for every / G B[ft)* that annihilates 

X Z) Xj we have 



Kf) 



V aeC'n ) 



< L 



f 



(A.25) 



H 



ence 



B(n)/i — ^}' This proves (A. 23b). 
A completely analogous argument handles (A. 24): it suffices to replace X and Xb{q,) 
everywhere by X -|- const and Xb(q,) + const, respectively. ■ 

Remark. The proof given here of (A. 23) is a slight elaboration of one sketched by 
Hugenholtz [198, p. 454]; by using complete subadditivity we are able to deduce the 
full van Hove convergence. 
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A. 3. 6 Closed and Compact Sets in B^ 

Proof of Proposition 2.39. (a) We shall actually prove something shghtly 
stronger, namely that {$: II^Hb^ ^ M} is closed in the product topology Hxe^ C{ftx) 
(which is weaker than the S° norm topology). So let ($„) be a sequence in {$: II^Hb^ ^ 
Af}, and let $ be another interaction; and suppose that ||($ra)x — ^x\\ -^ for each 
X. (This would occur, in particular, if $„ — )• $ in S° norm.) Then 

M M n z — / V II II Z — / V fj^oQiiv / II 

X30 \^^ I X30 \^^ I 

< l™inf^^||($„)x|| 

= liminf ||$„||b;, 

< M , (A. 26) 



where in the key inequality we have used Fatou's lemma. 

(b) Let ($ra) be a sequence in {$: II^Hb^ ^ M}. Since the single-spin space is 
finite, each space C(Ox), X finite, is finite-dimensional. Therefore, by compactness 
of the ball in C{ftx) together with the usual diagonal argument, we can extract a 
subsequence ($^0 such that ($„')x converges (in || • ||oo norm) for each X, say to $x- 
Let $ = {$x}- In part (a) we have shown that II^Hb^ ^ M. Now we wish to show 
that $„/ — )• $ in S° norm. So fix A^ < oo; we then have 

h(X) < K h(X) > K 

^ E ^A\{^n')x-^x\\ + 'iMiK . (A.27) 

X90 1^1 

h(X) < K 

Since /i ^ 1, the first sum is finite; and since ||($ra')x ~ ^x\\ -^ for each X, we have 

limsup ||$„/ - $||bo < 2MIK . (A. 28) 

n' 

Since K may be taken arbitrarily large, we are done. ■ 

Remarks. 1. For a converse to part (b), see Proposition A. 10 below. 

2. One might ask whether (a) and (b) can be extended to more general closed 
bounded sets in Bh (not just balls). The answer is no, in general: a closed bounded 
convex set in Bh need not be closed (much less compact!) in S°, if h is unbounded. 
Example: Let {A„} be a sequence of finite subsets of Z , in which each equivalence 
class modulo translation occurs at most once, and satisfying lim^^oo h{^n) = +oo. Let 
$„ be defined by 

(^\ _ J l//i(A„) if A is a translate of A„ /. ^„n 

1^ U otnerwise 
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Now, for each sequence X ^ £^ , let $ = J2'^=i ^n^n (this sum is absolutely convergent 
in Bh). Then ||$^||b;, = ||A||^i and ||$^ - ^^'Wb^ = ||A - A'||^i. (That is, A h^ $^ is an 
isometric isomorphism of £^ onto a closed linear subspace of Bh-) Now let S = {$^1, 
and let 

oo 

T = {$\ < A„ < 1 Vn, ^ A„ = 1} . (A. 30) 



n=l 

T is the closed convex hull of S in Bh- Now, II^Hb^ = 1 for all $ G T, so ^ T. On 
the other hand, does belong to the closure of T in S°, since lim^^oo ll^nllB" = 0. 

The natural setting for discussing the spaces S° and Bh is that of weighted £^ direct 
sums of Banach spaces. Let Ki, 1^2, ... be Banach spaces, and let /i: N — )• (0, oo). Then 
we define Yh to be the space of sequences y = (j/i, j/2, . . .), with each yi G l^-, for which 
the norm 

oo 

II?/||y, = J2H^)My, (A.31) 

8 = 1 

is finite. For h = 1 we write Yh = Y. It is easy to prove that all the spaces Yh are 
Banach spaces. The canonical projection pi'. Yh — ^ Yi defined by pi{y) = yi has norm 
l/h{z). 

We then have the following results: 

Proposition A. 9 The closed ball {y: \\y\\Yh ^ ^} ^-^ closed in the product topology 
UY.. 

i 

Proposition A. 10 Let S CY . Then the following are equivalent: 
(a) S has compact closure in Y . 
(h) S is hounded, pi[S] has compact closure in Yi for each i, and 

oo 

lim sup ^ \\y,\\Y, = . (A. 32) 

^^-°« yes ,=jv 

(c) S is bounded, pi[S] has compact closure in Yi for each i, and there exists a function 
/i: N — )• [1, oo) such that lim h[i) = +oo and 

sup||j/||y;j < oo . (A. 33) 

yes 

Note that if Yi is finite-dimensional, then S bounded =^ Pi[S] bounded =^ Pi[S] has 
compact closure in Yi. 

The proof of Proposition A. 9 is completely analogous to that of Proposition 2.39(a). 
Let us sketch the proof of Proposition A. 10: 
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(a) =^ (b): Let S be compact in Y. Then clearly pi[S] D Pi[S] is compact in 
Yi. Moreover, for each e > there exists a finite set y^^\...,y^^' G Y such that 

S C [j B{y^^\e). It follows that 



fc=i 



sup Y. \\y^\\y^ ^ ^ + ,n?,^/ H hi h ■ (A. 34) 



Taking A^ — )• oo, we get 



limsup sup ^ llj/illy, ^ £ • (A. 35) 



Since e was arbitrary, the proof is complete. 

(b) =^ (c): Choose A^i < A^2 < • • • such that 



Now define 



Then, for all j/ G S*, 



sup E ll?/»-|k. < 3"" . (A.36) 



^^^^ = ^ 2- for V! < z < A^.,, (^-3^) 



oo Ni—1 oo -^Vm-i-i— 1 

Ij/IIy;, = I]^(OII?/«-lk» = I] ll?/«-lk» + E 2" Y. h^h 

i=l i=l 7n = l i=Nm 



Ni — 1 oo ^ r) 

< J2 \\y^h + J2 (o 

i=l m = \ ^'^ 

< ||?/||y + 2. (A.38) 

Since S is, by hypothesis, bounded in Y, this proves the claim. 

(c) =^ (a): The proof is essentially identical to that of Proposition 2.39(b). ■ 

To apply this to our statistical-mechanical setup, let (Xi) be a sequence of nonempty 
finite subsets of Z in which each equivalence class modulo translation is represented 
once and only once. Setting Yi = C(Ox, ), it is easy to see that S° and Bh are isometric 
to the direct-sum spaces Y and Y/^, respectively. Therefore, Proposition A. 10 (a)^=^(c) 
tells us that any compact subset of S° is contained in the ball {$: II^Hb^ ^ M} for 
some h ^ 1 and some Af < oo. A similar result can be found in [208, Lemmas 1 and 
2]. 

Proof of Proposition 2.43. This is an immediate consequence of Proposition 
2.39(b), together with the following well-known fact: if A and B are subsets of a Banach 
space X, with A compact and B closed, then A-\- B is closed. ■ 
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A. 3. 7 Physical Equivalence 

Here we prove Theorem 2.42 on the equivalence of the two notions of physical equiv- 
alence (DLR and Ruelle). Since both senses of physical equivalence are statements 
about the difference $ — $', it suffices to consider the case $' = 0. 

Proof of Theorem 2.42, DLR =^ Ruelle. We wish to measure how strongly 
Hf{uj/^,uj/^c) depends on uj/^c Let us therefore define the oscillation of Hf with respect 
to LOf^c by 

oscAc(i7^) = sup \Hl{u) - Hl{u')\ 

C^A = ^A 

= sup \Hf{ujA,ojAc) - Hf{ujA,oj'j^c)\ . (A. 39) 

Considering now the definition Hf{uj) = Z]^.^nA5^0 ^-4(^)5 i^ i^ easy to see that 
oscA':{Hf) gets contributions only from sets A that intersect both A and A^, so that 

oscAc{Ht) < 2\\WIaA\oo- (A.40) 

In particular, for $ G S^ we have 

oscAc{Hf) < o(|A|) as A / 00 (van Hove) , (A.4I) 

by (2.62a). 

Suppose now that $ is physically equivalent to in the DLR sense, i.e. that Hf 
is jFAc-measurable for all A. (Actually, it suffices to assume this for some van Hove 
sequence of sets A.) Then Hf{ujA,'^A'') is independent of uja, so oscA':{Hf) is equal to 
the unrestricted oscillation 

osc{Hf) = sup Hf-MHf (A.42) 

— '2\\HA\\B{n)/const ■ (A. 43) 

Combining (A. 41) and (A. 43), we conclude that 

\\Ht\\B{a)/const < o{\A\) as A / TO (van Hove) . (A.44) 

By Proposition 2.45(c), we conclude that ||$||B''/(j"+Corast) = 0, i.e. $ G JT" + Const — 
that is, $ is physically equivalent to zero in the Ruelle sense. ■ 

Proof of Theorem 2.42, Ruelle =^ DLR. Suppose that the single-spin space 
Oo is a standard Borel space (e.g. a complete separable metric space), and that $, $' G 
B^ are physically equivalent in the Ruelle sense. Then by [157, Theorems 4.22 and 5.19 
and the comments after them], there exists a translation-invariant Gibbs measure for 
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$, call it fi. By Corollary 2.68, fi is an equilibrium measure for $. By Proposition 2.65, 
fi is an equilibrium measure also for $'. By Corollary 2.68 again, // is a Gibbs measure 
for $'. But then Corollary 2.18 implies that $ and $' are physically equivalent in the 
DLR sense. ■ 

Remark. The proof given here of Ruelle =^ DLR is aesthetically unsatisfying: 
the two notions of physical equivalence are statements purely about interactions and 
Hamiltonians, so there ought to be a purely "algebraic" proof of their equivalence 
involving only these concepts^ without dragging in the whole theory of equilibrium 
measures, Gibbs measures and their equivalence. In particular, it is galling to have to 
assume that Oq is a standard Borel space, for a result that obviously has nothing to 
do with topology. However, we have been unable to find such an algebraic proof; we 
hope that some reader will do so. 

A. 3. 8 Estimates on Hamiltonians and Gibbs Measures 

In Section 2.4.5 we stated Proposition 2.40 for the case of a compact metric single-spin 
space. Here we prove a more general theorem in which this restriction is removed. (We 
still consider only continuous interactions and functions, but that restriction too could 
be removed if we really cared.) 

Proposition A. 11 (= Proposition 2.40') The map [$] i-^ [/$] is an isometry of 
B°/J onto Cgi{n)/Igi, and ofB°/{J + Const) onto Cgi{n)/{Igi + const). Here Igi = 

inCgiin). 



Proof. It is convenient (following Ruelle [313, Section 3.2]) to introduce the modified 
observable 

n = J2 ^x. (A.45) 

where X 3mid denotes that is the \_{\X\ + l)/2j*'' element ("middle element") of 
X in lexicographic order. Clearly /$ — /4' G X^;. The advantages of /4' are due to the 
following easily verified facts [313, p. 37]: 

a) 1/4': $ G Bfimte} = Cloc{ft). 

b) 1/4': ^eB^} = Cgi{n). 

c) For all / G C,;(0), 

11/11 = inf ||$||bo . (A.46) 

Moreover, for / G Cioc{^) there exists a $ G B finite that attains this minimum. 
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In particular, the map $ i— )• [/$] = [f^] is onto Cqi{ft)/Xqi. 
Now, from ||/$||oo < II^IIb" we easily deduce that 

ii[/c^]iic(Q)/x < \mho/j. 



(A.47) 



To prove the reverse inequality, note that by Proposition A. 8 we have, for any / G 



Now, by property (c) above, for each 



C,/(0), ||[/]||c(Q)/x = limA/oo |A|-^ E TJ 

aeA 

A and each e > we can choose ^ G S° such that f^ = |A|~^ E Taf and H^Hb" ^ 

aeA 

A|~^ E Taf + e. (In particular, we have [/] = [f^] = [f^s]-) Then, by taking 
A /* oo and e J, we conclude that for all / G Cg;(0), 



c(Q)/i > inf II^IIb" 

^ >' *eBO: [/4']=[/] " 



In particular, taking f = f^^ we get 

||[/c^]||c(Q)/X > \m\\BO,J. 



(A.48) 



(A.49) 



This proves that the map [$] i— )• [/$] is an isometry of B^ j J into C{ft)/X. 

Repeating the same argument with / replaced by / + c, and then optimizing over c, 
we conclude that [$] i— )• [/$] is also an isometry of S°/(j7+ Const) into C(0)/(X+const). 



Proof of Proposition 2.44. 

(a) is easy and well known: see [206, p. 9]. 
(d) By definition we have 



H 



A, free 



XCA 



anc 



xeA xeAX30 

xeAY3x 

v-l^nAI 

= > -^ — r-^^Y 

\Y\ 



(A. 50) 



(A.5i: 



(the double sum is absolutely convergent and hence can be rearranged freely). Thus 

^ IxnAl 

X n A= ^ 



Hkjree Z^ Tr^fq, 



\X\ 



-$v . 



(A.52) 
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Taking norms, we have 



I^AJree - XI ^^/*IU < Yl 1^1 II^^IU 

X n A= ^ 

= E E i^r^ii^xiioo 

xeA X 3 X 

X nA= ^ 

= E E i^n^^ 

(y + a;) nA= ^ 



y90 I-* I 



Now divide by |A|: 



-IN ZTC^ V- ^ . N / V- K^' -Y)nA\\\^y\\ ^ 

lAI IFI 



xeA Y30 



This sum is dominated uniformly in A, since |(A^ — Y) Cl A|/|A| < 1 and $ G S°. On 
the other hand, for each fixed finite set K, we have 

|(A--y)nA| ^ v^ |An(A--y)| 

|A| - ^ |A| 

yeY I I 

which tends to zero as A /* oo (van Hove). Hence, by the dominated convergence 
theorem, (A. 54) tends to zero as A /* oo (van Hove). 

(b) and (c) are immediate consequences of (d) together with Propositions A. 8 and 
A. 11. ■ 

Remark. See [313, p. 41] for an alternate proof of (c), carried out first for $ G Bfinite 
and then extended to S° by density. 



Proof of Proposition 2.45. 

(a) is easy and well known: see [206, p. 14] or [157, p. 29]. 
(d) By definition, 

Ht - Htj,^^ = Wl^. = E ^x • (A.56) 

Xn A ^ 
X n A= ^ 
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Taking norms, we have 



Xn A ^ 
X n A= ^ 



X oo 



< E E 11^ 

2^eA X 9 a; 

X nA= ^ 

= E E 

(y + a;) nA= ^ 

= El(^'-^)nA|||$y||oo 

Y30 



|$y| 



(A.57) 



The remainder of the argument is completely parallel to the proof of Proposition 
2.44(d), but using $ G S^ rather than $ G B°. This proves (2.62a). 

As for (2.62b), the leftmost term is o(|A|) as an immediate consequence of (2.62a) 
and (2.58). The middle term is proven to be o(|A|) in [157, pp. 320-321]. (That proof 
is stated only for cubes, but it is valid for arbitrary van Hove sequences.) 

(b) is then an immediate consequence of (2.62a) and (2.56). (c) is likewise an 
immediate consequence of (2.62a) and (2.57). ■ 



Proof of Proposition 2.46. Let // be a Gibbs measure for an interaction $ G S^ 
and a priori measure /i°. Then the DLR equation (2.22) states that 






[^A, 



/ diJ.{T) Zf{TAc) ^ exp[-Hf{(jjA X tac)] 



where 



ZtiTAc) = / exp[-Hf{LOA X tac)] Y[ dfil 



iOx 



(A.58) 



(A. 59) 



Now, by Proposition 2.45(d), we can replace Hf{uj/^ X tac) everywhere by Hf^ ^^^{iOjC)^ 
incurring an error which is o(|A|) uniformly in u and r. Therefore, 

< o(|A|) . (A.60) 



log ll + <,.ee + log Z^ 



A, free 



But ||i7|^^,,,-E.eAr.M|oo<o(|A|) by Proposition 2.44(c), and I log Z|^^,,,-|A|p($|/)|< 
o(|A|) by Proposition 2.58(a). Hence 



logy^ + E^^/^ + IAK^I/: 



< o(\A\ 



(A.6i: 
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In particular, 



^°^li\?/^^^ 



< oflAI 



(A.62) 



C(n)/const 

This proves (2.63). This bound is uniform for all ji G ^(H*). 

Now let /ii (resp. IJ.2) be Gibbsian for interactions $1 (resp. $2) in ■S^, with the 
same a priori measure /i°. Combining (A. 61) for the two cases, we get 



log 



J/i 



lA 



dji 



2A 



^r./c,_c, + |A|b($i|/)-p($2|/)] 



+ o(|A|) . (A.63) 



But 



7 . -LxJ^i-^2 

xeA 



lAI ||$i - $ 



2\\b°/j 



+ o(\A\ 



by Propositions A. 8 and A. 11, while 

p($l|/)-p($2|/)| < ||$1 -$2||bo/^ 

by Propositions 2.56(d,e) and 2.58(a). Hence 



log 



dfj. 



2A 



< 2|A| ||$i -<^2\\bo/j + o(|A| 



(A. 64) 



(A. 65) 



(A. 66) 



But by Propositions 2.56(c) and 2.58(a), the right-hand side of (A.63) is unchanged if 
we replace $1 by $1 + ^ with ^ G Const (i.e. if fq, G const). Thus, in (A. 66) we can 
replace the B^ j J norm by B^ j {J + Const). This proves (2.64). 

In a similar way we deduce (2.65) from the two cases of (A. 61) together with (A. 64). 



Remarks. 1. We wish to emphasize that (2.65) is an equality. This fact plays a 
crucial role in our proof of the Second Fundamental Theorem (Section 3.3). 

2. The proofs of Propositions 2.56 and 2.58 do not use these estimates, so the 
reasoning is not circular. 

A. 4 Proofs and References for Section 2.5 

Proposition 2.51 is easy to prove: see e.g. [206, Lemma 1.2.2]. 

Proof of Proposition 2.53. 

(a), (b), (c) and (g) are proven in [157, Proposition 15.5]. 

(d) is a trivial generalization of what is proven in [157, Proposition 15.14(1)]. 

(e) is proven for the bounded measurable topology in [157, Corollary 15.7 and 
proof of Proposition 15.14(2)]. For the weak topology, see [206, pp. 42-43]; though 
stated there for compact metric spaces, the proof is in fact valid for arbitrary complete 
separable metric spaces. See also [157, p. 316]. 
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(f) We know from part (e) that {//: I[fi\p) < c} is closed in the bounded mea- 
surable topology. In [157, proof of Proposition 15.6] it is shown that the densities 
{dfi/dp: I[fi\p) < c} are uniformly zz-integrable (see also [73, Theorem 11-22]); this 
implies, by the Dunford-Pettis theorem, that {//: I[fi\p) < c} is relatively compact 
and relatively sequentially compact in the bounded measurable topology ([73, Theo- 
rem 11-25] or [272, Proposition IV-2-3]). Since the weak topology is weaker than the 
bounded measurable topology, the last statement is an immediate consequence. 

(h) is proven in [102, Theorem 2.1 and Lemma 2.3]. 

(i) is an abstraction of the usual statement of strong superadditivity [157, Proposi- 
tion 15.10]. ■ 

Remarks. 1. Statement (d) is not true jointly in fi and p. Counterexample: Let 
= {a, 6}, /ii = z/2 = 6a, fi2 = 1^1 = h-, Ai = A2 = \. Then /(/ii|z/i) = /(/U2k2) = +00, 
while /(|/Ui + \lJ.2\\vi + \v2) = 0. 

2. For some improvements of (d) if the fii have "almost disjoint" supports, see [26, 
Proposition 5.1 and Corollary 5.2] and [326, Theorem 2.1]. 

3. If the cr-field E is countably generated, then the set {//: I[fi\p) < c} is in fact 
compact and metrizable in the bounded measurable topology: this follows from [73, 
Theorem 11-24]. 

4. Additional useful properties of the relative entropy are given in [157, Proposition 
15.6 and Corollary 15.7]. 

The finite- volume variational principle (Theorem 2.54) is well known: see e.g. [206, 
p. 46] or [99, Lemma 2.1]. 

A. 5 Proofs and References for Section 2.6 

A. 5.1 The Infinite- Volume Limit: Proofs 

Proof of Proposition 2.56. For all but part (e), see [206, Theorems 1.2.3 and 
1.2.4]. Part (e) is an immediate consequence of Proposition 2.34(e). ■ 

Proposition 2.57 is [313, Proposition 4.4]. Proposition 2.58 is an immediate conse- 
quence of Propositions 2.56 and 2.57 together with the estimates (2.58) and (2.62b). 

Proof of Proposition 2.59. When ;/ is a product measure, this is [157, Corollary 
16.15(b)]. When ;/ is a Gibbs measure, this follows from the product-measure case 
together with (2.90). ■ 



Proof of Proposition 2.61. 

The existence of the van Hove limit, and its equality to the supremum, both follow 
from the strong superadditivity of /a(/u|z^) as a function of A, when ;/ is a product 
measure [Proposition 2.53(i)]. One way to see this is to note that strong superadditivity 
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implies complete superadditivity (Lemma A. 2); the claim then follows from Proposition 
A. 3. Alternatively, one can make a direct argument using the strong superadditivity 
[206, Theorem 11.2.2]. 

The affineness is an immediate consequence of Proposition 2.53(c,d), and the lower 
semicontinuity is an immediate consequence of Proposition 2.53(e) and equation (2.93b); 
see [206, Theorem II. 2. 3] or [157, Proposition 15.14]. 

The proof of (d) employs the following construction: Pave Z by a cube C„ and its 
disjoint translates. Now, given a translation-invariant measure //, let pn be a measure 
which equals // when restricted to each of these cubes, and in which the copies of the 
spins in the various cubes are rigidly forced to be equal. Then let //„ = n~'^ J2aec„ TaPn- 
By construction //„ is translation-invariant; and with a little work one can prove that 
i{pn\i^) = i-max- On the other hand, it is easy to see that lim^^oo /"n = lini„^oo Pn = fJ- 
in the bounded quasilocal topology. 

(e) is proven in [206, Lemma IV. 3. 2]. 

When Oo is a standard Borel space (e.g. a complete separable metric space, or a 
Borel subset thereof), the compactness in the bounded quasilocal topology is proven 
in [157, Proposition 15.14(3)]. (We do not know whether the result is true for more 
general spaces Oq-) Since the weak quasilocal topology is weaker than the bounded 
quasilocal topology, the last statement is an immediate corollary. ■ 

Proposition 2.62 is proven in [157, Theorem 15.30(b)]. 

Remark. FoUmer [122] has given a beautiful formula for i{p\i') in terms of the 
relative entropy [not relative entropy density]) of the conditional distributions of // and 
V given the lexicographic past. See also [157, Proposition 15.16 and Theorem 15.20]. 

Theorem 2.63 is essentially [157, Theorems 15.30(b) and 15.39]. 

Remark. A rather weak converse to Theorem 2.66 is the following: Let p\,p2 £ 
Mj^\^inv{^) with p2 Gibbsian for $2 £ ■S^, i{f'i\f'2) ^ K and pi ergodic. Then there 
exists an interaction $1 G B^ [not B^\) with ||$i — $2||bo ^ Kl2 such that pi is 
an equilibrium measure for $1. This can be proven using the Bishop-Phelps theorem 
[206, Corollary V.2.1]. The same is true if pi is a finite convex combination of ergodic 
measures, but then the constant K l2 is replaced by a worse one. 

Theorem 2.67 is proven in [157, Theorem 15.37]. The proof is given there for a 
sequence of cubes, but the same proof works for an arbitrary van Hove sequence. 

Proof of Corollary 2.68. Q^nv{^'^) + 0, so let v g Q^nv{^^) and use (2.110). 
Then Gibbs ==^ equilibrium is Theorem 2.66, and equilibrium ==^ Gibbs is Theorem 
2.67. ■ 
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A. 5. 2 The Infinite- Volume Limit: Counterexamples 

As mentioned in Sections 2.6.1 and 2.6.2, the existence of the hmits defining the infinite- 
volume pressure p{f\i') and the infinite- volume relative entropy density i{fi\i') is a 
highly nontrivial problem: contrary to what might be supposed at first glance, these 
limits do not always exist. The first counterexamples bearing on this problem are due 
to Kiefi'er [216]. Here we give a simplified version of Kieffer's counterexample, due to 
Sokal [331]: 

Let = { — 1,1}^. Let Pn be the measure which gives weight l/2n to each of 
the periodic sequences of period 2n consisting of n I's followed by n — I's. Let p be 
the convex combination Y,'^=i o^n^n- We shall show that for a suitable choice of the 
coefficients {a„}: 

( ^ \ 

(a) For the function f{io) = cuq, the pressure lim k ^log/exp Xl ^i dvi^u) does 

not exist. 

(b) For the measure // = (5_|_ = delta measure concentrated on the sequence of all 
-|-l's, the relative entropy density lim A;~^/ji^,,,^/;i(/i|z/) does not exist. 

( ^ \ 

Proof of (a). Let gn{k) = /exp J2 coi dvn{to). It is easy to see that </„ is a 
periodic function of period 2n, and satisfies the (crude) bounds 

— e^"^'^) < gn{k) < e^"^'^) , (A.67) 

2n 

where 

Fn{k) = n - |A;(mod2n) -n| (A. 68) 

is the sawtooth function taking the value at A; = 0,2n,4n,... and the value n at 
k = n, 'in, 5n, .... Hence 



9{k) 



exp^cuj dv{Lo) = ^a^gnik) 



> y^ On ^F„(k) 



oo 



2n 






(A. 69) 



Now choose the sequence {a„} to have huge gap 



s: 



a„ = const X ( ^"""^ ^^ ^ = 2' ^^ ^ome integer / ^^^q^ 

I otherwise 

where a > will be chosen later. Then for k = 2' we have the lower bound 

g{k) > a,g,{k) > ge^'=W = ^ e^'^^' (A.71) 
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and hence 



liminf-log5(2') > 1-a. (A. 72) 



On the other hand, for 2' < k < 2'"'"^ we have the upper bound 



uppe 

2' oo 



g{k) < Y.ane'' + ^ a„ e'^ 

n=l n=2' + ^ 

[using Fn{k) < min(n, k)] 



e' 



\n=l J \n=2' + ^ I 

< e'' + const X e-"^'^^'^ 

[const depends on a only] 

< const X exp[max(2', k - a2'+^)] . (A. 73) 

Defining /3 = k/2' (so that 1 < /3 < 2), we find 

1, ,,, const (1 2a\ /. „ x 

^log<,(t)<^ + max(-,l--j. (A.74) 

Now choose any < a < |. Then max(l//3, 1 — 2a//3) is minimized at /3 = /3* = 2a + 1 
(which satisfies 1 < /3* < 2) and takes the value l/(2a + 1) there. By choosing 
k = [2'/3*J and letting I —■ oo, we conclude that 

Since l/(2a + 1) < 1 — a when < a < |, it follows from (A. 72) and (A. 75) that 
lim k~^ log g[k) does not exist. ■ 

Proof of (b). It is easy to see that 

h{k) = I{i^,„^k}{6+\v) = -logv{uJi = ... = Uk = +1) 

- n-k+1 

= -logl^fln • A. 76 

2n 

n=k 



Let us again take 



const X ("""'' if n = 2' for some integer / ^^^^^ 

otherwise 



Then for k = 2' we have 



h{k) < -log^ = aA; + log(2A;) . (A. 78) 
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On the other hand, for 2' < k < 2'"'"^ we have 

2" - A; + 1 



oo 

-a2" 



h{k) = -log ^ e- 

m = l+l ^ 



> -log E «-"'" 



m = l+l 

> const + a2'+^ (A. 79) 

[const depends on a only] 



Thus 



1-^V 



limsup— /i(2') < a (A. 80a) 



1 



2 



liminf— ^/i(2' + l) > 2a (A. 80b) 

i^cx> 2+1 

So for any a > we conclude that lim k~^h[k) does not exist. ■ 

We note also that Varadhan [359] and Newman [275] have given an example of a 
mixing Gaussian process for which the pressure does not exist. 

B Low- Temperature Phase Diagrams and Pirogov- 
Sinai Theory 

B.l Generalities on Phase Diagrams 

The central problem in equilibrium statistical mechanics is the description of the set 
of Gibbs measures for a given interaction. More generally, families of interactions 
(or of specifications) are considered, with members labelled by certain parameters: 
inverse temperature^^ /3, magnetic field, chemical potential, etc. The ultimate goal 
is then to describe the set of Gibbs measures, in particular the number of extremal 
Gibbs measures, as a function of these parameters. The partition of the parameter 
space into regions with different numbers of extremal Gibbs measures is called a phase 
diagram of the family of interactions, and the manifolds delimiting such regions are 
called phase-transition manifolds . 

A natural approach to the difficult problem of determining the full phase diagram 
is to fix first some of the parameters so that the resulting "restricted" phase diagram 
is amenable to a comparatively simple analysis. Then, one studies whether this phase 
diagram is "stable", that is, whether a small change in the fixed parameters produces 



^®As we want to explicitly discuss the role of this parameter, throughout this appendix we un-absorb 
(3 from interactions and Hamiltonians. 
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only a small deformation of the diagram keeping unaltered the main properties of the 
extremal Gibbs measures. 

The most widely used "restricted" phase diagrams are the high-temperature (/3 = 0) 
and low-temperature (/3 = oo) limits. In the former, the situation is particularly simple: 
The finite- volume Gibbs distribution (2.20) becomes for /3 = just the product measure 
YlxeA d'lJ^x independently of the boundary condition. Hence, there is a unique Gibbs 
measure, namely /i° = Ylxec^lJ^x^ which corresponds to independent spins, the one at 
site X distributed according to the a priori measure /i°. [Note that for translation- 
invariant Gibbs measures the same conclusion follows from the variational principle 
(2.105a), which for /$ = requires z(/i|/i°) = inf z(- |/i°) = 0, hence // = /i°.] It is 
well known that this infinite-temperature phase diagram is stable in a suitable space 
of interactions: for fi small the Gibbs measure remains unique and it corresponds to 
weakly dependent spins. This has been proven for lattice-gas [140] or, more generally, 
spin-1/2 [205] interactions in B^ , and for general interactions in B^ [84, 177]. It is not 
known whether it is true for general interactions in B^ . 

The phase diagram for the zero-temperature limit is, in general, more complicated 
to describe; its stability is the subject of Pirogov-Sinai theory. In this appendix we give 
a brief overview of the conclusions of this theory with an eye on the applications needed 
in Section 4. Its understanding requires, of course, a proper grasp of the basic notions 
involved in the construction of zero-temperature phase diagrams. As remarked already 
in the seminal work of Ruelle [311], the formalism for zero-temperature statistical 
mechanics has some important differences with the one for finite temperatures reviewed 
in Section 2. Moreover, the nomenclature adopted throughout the existing literature is 
often a source of confusion, with different authors assigning different meanings to the 
same words. Therefore, for the convenience of the reader and to fix the terminology, we 
start with a review of the zero-temperature formalism. For this part of the appendix, 
the reference closest to our needs — and from which we have taken many of the ideas 

— is the review by Dobrushin and Shlosman [95]. However, for the sake of consistency 
with the rest of our work, we adopt a nomenclature slightly different from theirs. We 
shall parenthetically contrast these differences both for the benefit of the reader familiar 
with [95] and as a token of the confusing state of the nomenclature. 

Let us state once and for all that in this appendix, we consider only the case of 
periodic interactions and finite single-spin space, i.e. |0o| finite. Moreover, except in 
Sections B.2.9 and B.4.4, the interactions are assumed to be of finite range. 

B.2 Zero-Temperature Lattice Systems. General Formalism 

Heuristically, as /3 — )• oo only configurations with minimal energy "survive" , the others 
being exponentially damped by the Boltzmann factor. However, in the general theory 
of zero-temperature statistical mechanics — as in statistical mechanics quite generally 

— the central objects are not individual configurations but rather probability mea- 
sures describing a random distribution of configurations [303]. Just as for non-zero 
temperature, such measures can be defined either via specifications or via a variational 
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principle. 

B.2.1 Zero-Temperature Gibbs Measures 

Let us start with the approach based on specifications. We see that the /3 — )• oo hmit of 
the finite- volume Gibbs distribution (2.20) with a fixed boundary condition produces a 
measure concentrated on the configurations of "minimal energy" for the given boundary 
condition and giving equal probability to each such configuration. That is, for any 
interaction $, any finite volume A and any boundary condition r G Oa^, we have 



/3$^/,^^ _ /^a(^ n j^a.t; _ $,T=0^ 



lim vr-(A) = '- ,^, r ^ TTiriA) (B.l) 



for all sets A G J^a, where 0^^ is the set of configurations uj^ in A minimizing the 
energy Hf{ujA X tac): 



^A,T = 1 ^a: ^a (^a X ^aO = ^inf i?* (^a x tac) \ . (B.2) 

We call n*'"^~° = (tt^'^" )Aes the zero-temperature specification (or ground-state spec- 
ification [95]) for the interaction $. 

Definition B.l A zero-temperature Gibbs measure /or $ is a measure consistent with 
the specification (B.l). 

We remark that the specifications (B.l) are quasilocal (since we only consider finite- 
range interactions), but not uniformly nonnuU, hence they are not Gibbsian. Therefore, 
zero-temperature Gibbs measures happen not to be honest Gibbs measures. In fact, 
the possibility of including (B.l) in the general framework is one of the advantages 
of introducing the general notion of specification (Section 2.3.1), rather than just the 
more restricted (and popular) class of Gibbsian specifications (Section 2.3.2). The 
zero-temperature Gibbs measures for a given interaction $ form a (weakly) closed 
— hence (weakly) compact — convex subset of the compact metric space Af_|_i(0) C 
Af(0). Therefore, by Choquet's theorem [293] any such measure can be written as the 
barycenter of a probability measure concentrated on the extreme points. In fact, the 
general theory of specifications guarantees us that this decomposition into extremal 
measures is unique [157, Theorem 7.26], i.e. that the set of zero-temperature Gibbs 
measures is a simplex. 

B.2. 2 Ground-State Configurations. Support Properties of Zero-Temperature 
Gibbs Measures 

The specifications (B.l) satisfy 

TrA{oJA\uAc) = unless cua G 0| ^ (B.3) 
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This property implies that the zero-temperature Gibbs measures — which satisfy //tta = 
fi for every finite set A — are supported by the set of configurations io such that 
c<^A £ ^Auj c foi" Gvery finite A, i.e. is of configurations that minimize the local energy 
when they themselves are the boundary condition. Configurations with this property 
are called ground-state configurations. By (B.2) they can be characterized as those 
configurations whose energy cannot be lowered by any change involving only a finite 
number of spins. That is, cu G is a ground-state configuration for an interaction $ if 
and only if for every A and every configuration u' such that Uf^c = cu^c , we have 

Hl{J)-Hl{u)^ Y: [$4^0 - $aH] > . (B.4) 

Acs 
AnA^0 

The set of ground-state configurations is closed (hence compact) because the condi- 
tions (B.4) involve finite- volume Hamiltonians which are continuous functions of the 
configurations. This fact of being a closed set justifies the use above of the expression 
"is supported by" (= "its support is a subset of"). We recall that the support of a 
measure fi is the smallest closed set of full measure (Section 2.1.3). 

The fact of being supported on configurations satisfying (B.4) is not equivalent to 
being consistent with the specifications (B.l) — it is weaker. The more general mea- 
sures characterized only by this support property turn out to play an important role 
in the study of the stability of zero-temperature phase diagrams (Theorem B.12 be- 
low). Inspired by [95], we call these measures w- (for weak) zero-temperature measures. 
Formally: 

Definition B.2 A w- zero-temperature measure for an interaction $ is a measure fi 
satisfying 

//({ground-state configurations for $}) = 1 . (B-5) 

In Section B.2. 7 we discuss a natural limit process that produces w-zero-temperature 
measures, and we present an example (for the Ising antiferromagnet with a magnetic 
field) in which this limit process produces a translation-invariant w-zero-temperature 
measure which is not a zero-temperature Gibbs measure. 

Obviously the set of w-zero-temperature measures for a given interaction $ is 
(weakly) closed — hence (weakly) compact and convex. The extreme points are simply 
the delta measures 6^^ concentrated on a single ground-state configuration io. This set 
is therefore trivially a simplex. 

The previous discussion can be summarized in the following way: 

Tlieorem B.3 Every zero-temperature Gibbs measure is a w-zero-temperature measure 
for the corresponding interaction, i.e. it satisfies (B.5). 

This theorem constitutes the precise version of the idea that only configurations with 
minimal energy "survive" at zero temperature. 
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B.2.3 Rigid Ground-State Configurations 

The set of ground-state configurations is in general rather large. Already the ferro- 
magnetic Ising model provides a rich illustration. This model has exactly two periodic 
(in fact translation-invariant) ground-state configurations: the all-"-|-" and the all-"—" 
configurations. But in addition it presents infinitely many non-periodic configurations 
exhibiting interfaces between "-|-" and "— " spins. In all dimensions we have the fiat- 
interface configurations: 

f +1 for xi > . . 

^^ = I -I for X, < (^-^^ 

(and translated, 90°-rotated and I80°-rotated versions of this). In higher dimensions 
we have a growing zoo: For dimensions d > 2 we have configurations with interfaces 
in the form of staircases; for d > 3 there appear configurations resembling "books on 
a table" or "books on a staircase" [95]. See this last reference for a partial catalogue. 
Not all these configurations are equally relevant for zero- and low-temperature phase 
diagrams. We can distinguish three mutually exclusive categories roughly representing 
different (for us decreasing) levels of relevance. We shall call them rigid, convivial 
and superfluous . The rigid configurations are usually the most important ones (albeit 
not the most numerous); they are associated to deterministic zero-temperature Gibbs 
measures: 

Definition B.4 For a given interaction $, a ground-state configuration u is called 
rigid [8] if the measure 6^^ concentrated on u is a zero-temperature Gibbs measure for 
$, i.e. is consistent with the specification (B.l). 

A simple calculation proves the following: 

Proposition B.5 A ground-state configuration u is rigid if and only if 

|f^wl = l (B.7) 

for all finite A. 

In words, this theorem says that u placed as a boundary condition determines uniquely 
the minimal-energy configuration inside any given volume (thereby justifying the qual- 
ifier "rigid"). Equivalently, any local change of cu produces a strictly positive change of 
energy. Usual phase-diagram studies — in particular Pirogov-Sinai theory — deal only 
with these deterministic zero-temperature Gibbs measures and their low-temperature 
perturbations. ( Warning: Reference [95] reserves the name "ground-state configura- 
tions" only for the rigid ones.) 

For the Ising model (ferromagnetic, zero magnetic field), it is clear that the all-"-|-" 
and all-"—" configurations satisfy (B.7) and hence they are rigid in any dimension. The 
case of the non-periodic ground-state configurations (fiat-interface, staircase-interface, 
etc.) is more delicate. There is, however, a simple argument [95] showing that if u 
is a ground-state configuration for the J-dimensional Ising model, then its cylindrical 
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extension to an extra dimension — defined by iO(xi,...,xa,xa+i) — '^{xi,...,xa) — i^ ^ rigid 
ground-state configuration for tlie [d-\- f )-dimensional Ising model. Indeed, if we tliink; 
of tlie extra dimension as "vertical", any local change of uj consists of a finite stack 
of local changes of io. The bottom and top J-dimensional sections of this stack face 
sections where the configuration is equal to io without changes. Thus, some of the 
corresponding "vertical" bonds join antiparallel spins, which produces a strictly positive 
contribution to the change in energy. This proves (B.7) and hence the rigidity of uj. 

As a consequence of this argument, we conclude that the fiat-interface configurations 
are rigid for d > 2^ the staircase-interface ones are rigid for (/ > 3, and so on. The proof 
that the rigidity does not extend below such dimensions requires further arguments. 
We shall comment on this below. 

Remark. Rigidity of a ground-state configuration io does not exclude its belonging 
also to the support of some zero-temperature Gibbs measure that is not deterministic. 
For instance, if there is more than one rigid configuration, one can of course take 
convex combinations of the corresponding delta-measures. The possibility of a less 
trivial example will be discussed below, after (B.IO). 

B.2.4 Convivial Configurations. Zero-Temperature Entropy 

However, not all is deterministic in zero-temperature life. Our next type of configura- 
tions are those that belong only to the support of a non-deterministic zero-temperature 
Gibbs measure. We recall that the support of a measure is the complement of the union 
of all the zero-measure open sets (that is, the smallest closed set with full measure). 

Definition B.6 For a given interaction, a ground-state configuration u is called con- 
vivial if 6t_^ is not a zero-temperature Gibbs measure but there exists a zero-temperature 
Gibbs measure having u in its support. 

These ground-state configurations, which individually have little or no weight but are 
relevant as an ensemble, and the associated non-deterministic Gibbs measure supported 
on such an ensemble, are probably not what the physicist-in-the-street has in mind 
when thinking about zero temperature. One expects them in cases where there is a large 
degeneracy in the ground state. The precise concept measuring such degeneracy is the 
zero-temperature entropy (also called residual entropy). For the sake of completeness, 
we briefiy review the definition and principal properties of this quantity. Our main 
reference is the classic article by Aizenman and Lieb [8]. 

There are some subtleties involved in the right notion of zero-temperature entropy. 
Heuristically, its computation requires a limit process: one must compute (or measure) 
a sequence of low-temperature entropies and take the limit as the temperature goes 
to zero. The so-called "third law of thermodynamics" claims that such a limit must 
be zero; such behavior is indeed seen in simple models, but not always. Its viola- 
tion must be interpreted as signaling a large "degeneracy of the ground state". The 
formalization of these ideas requires a consideration of the role of the infinite- volume 
limit. As pointed out by some authors (see references in [8]), the volume must be 
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sent to infinity before taking the limit T — )• 0. But in this case, one must consider 
with some care the boundary conditions. If, motivated by the "ground-state-energy" 
(= variational) approach [see eq. (B.I4) below], one works with pre-fixed — for in- 
stance free — boundary conditions, then there are examples where the contribution of 
some excited configurations survives the zero-temperature limit, so that the residual 
entropy seems to be measuring more than just the "degeneracy of the ground state". 
The correct way to consider the boundary conditions, and hence the right definition 
of "degeneracy", was pointed out by Aizenman and Lieb [8]. At the same time, they 
provided a remarkable formula for the zero-temperature entropy purely in terms of 
zero-temperature concepts, with no reference to limits from finite temperatures. We 
shall take this formula as the definition. For a finite set A and an interaction $, let us 
denote Qf the set of restrictions to A of the ground-state configurations for $. 

Definition B.7 The zero-temperature entropy for an interaction $ is the limit 

A/^oo |A| 

In words, this formula says that a system has non-zero residual entropy iff the num- 
ber of distinct ground-state configurations, as viewed within a finite volume, grows 
exponentially with this volume. Following [244], it is suggestive to call such models 
super-degenerate . Intuitively, this feature requires the presence of competing interac- 
tions to produce a sufficient large number of ground-state configurations. Indeed, it 
can be proven [8] that all ferromagnetic models have zero residual entropy. 

The key result establishing the connection between non-zero residual entropy and 
existence of convivial ground-state configurations is the following. To abbreviate, for 
a translation-invariant (or periodic) measure fi we denote s(/i) = — z(/i|/i°) -|- log |0o|, 
where i{fi\i') is the relative entropy density defined in Section 2.6.2, and /i° is the 
product over all sites of normalized counting measure. (This the the physicists' usual 
entropy, which is defined relative to unnormalized counting measure on the single-spin 
space Oo — this accounts for the additive constant log |0o|.) 

Proposition B.8 Fix an interaction $. Then: 

(a) If fi is a translation-invariant w- zero-temperature measure for ^ , then s[fi) < s$. 

(b) There exists for $ a translation-invariant zero-temperature Gibbs measure fi such 
that s[fi) = s$. 

We summarize below the results on which this proposition is based (Proposition B.13 
and Theorems B.ll, B.15 and B.17 part (b)). We note that if the support of // is a finite 
set, then s(/i) = [// is of the form Y^i^i^tu,} hence s(/i) < — (1/|A|) Xli ^i log Aj- — )• 0]. 
Therefore, we conclude the following: 

Proposition B.9 A super-degenerate system, with finitely many rigid ground-state 
configurations exhibits infinitely many convivial ground-state configurations. 
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This proposition covers all the cases we know of in which the existence of convivial 
ground-state configurations has been proven. Consider, for example, the model with 
spins (jJi = —1,0, 1 and Hamiltonian 

H,= Y. [u^-u^y. (B.9) 

h-j|=i 

The ground-state configurations for this model are all the configurations with no spin 
equal to zero, and the all-"0" configuration. The zero-temperature entropy for this 
model is exactly log 2. As the all-"0" configuration is the only rigid one, we conclude, 
by the previous proposition, that there must be infinitely many convivial ground-state 
configurations. Another important example is the Ising model with nearest-neighbor 
antiferromagnetic coupling of strength J and magnetic field h = 2d\J\. Its ground-state 
configurations are those in which no two nearest-neighbor spins are simultaneously "— ", 
a fact that produces a non-zero residual entropy. There are no rigid configurations, 
hence the proposition implies the existence of many convivial ones. For this model, 
such a fact can be proven also by a different argument which yields some additional 
insight. Indeed, by identifying a "— " spin with the presence of a particle, the ensemble 
of ground-state configurations — with the associated conditional probabilities giving 
equal weight to all of them — is seen to correspond to the grand-canonical ensemble 
for the ideal lattice gas with nearest-neighbor exclusion and chemical potential equal 
to zero. Using a beautiful computer-assisted proof, Dobrushin, Kolafa and Shlosman 
[87] proved that such a system has an unique Gibbs measure. As none of the ground- 
state configurations are rigid, this Gibbs measure is non-deterministic and therefore 
supported on convivial configurations. This example shows a way (in fact, the only 
one we know of) to interpret and understand the characteristics of non-deterministic 
zero-temperature Gibbs measures supported on (very many) convivial ground-state 
configurations: One maps it into a statistical-mechanical problem for another, better 
understood, equivalent system. As the original ensemble involves configurations sat- 
isfying some condition derived from the minimal-energy requirement, this equivalent 
system will, in general, be a model with exclusions. That is, it will not fit into the 
general formalism developed in Chapter 2. 

Proposition B.9 does not yield any information on models with zero residual en- 
tropy, for instance on ferromagnetic systems. In particular, the question remains of 
whether conviviality requires super-degeneracy. A possible counterexample is presented 
in reference [95]: Consider, for the three-dimensional ferromagnetic Ising model, the 
ensemble of ground-state configurations that differ only locally (i.e. in finite volumes) 
from the "zig-zag interface" one: 



+ 1 if ti + t2 + ts > 

'(*l'*2.*3) ] _1 if ti+t2+t3 < . 



-afr:!, = , . T.Tr^n (B.io) 



Such an ensemble can be mapped onto an appropriate solid-on-solid model. If this 
model can be proven to have at least one Gibbs measure (a problem still open), then 
it would imply that the above configurations are convivial. We must acknowledge that 
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the standing conjecture [85, 95] is that such Gibbs measures do not exist. Note that 
if this Gibbs measure exists, then the all-"+" and all-"—" configurations would be at 
the same time rigid and in the (boundary of the) support of a highly non-deterministic 
zero-temperature Gibbs measure. 

Nevertheless, there is an interesting result (Proposition 4 of [8]) involving models 
with zero residual entropy: 

Proposition B.IO If s^ = 0, euerj/ translation-invariant w- zero-temperature measure 
for $ is supported on the set of rigid ground-state configurations. 

That is, if a model with zero residual entropy does in fact possess convivial ground-state 
configurations, then such configurations can lie in the support only of non-translation- 
invariant zero-temperature Gibbs measures. 

Remark. On the other hand, Radin [304] has shown examples of super-degenerate 
systems with a unique translation-invariant w-zero-temperature measure entirely sup- 
ported on the set of rigid ground-state configurations. In these examples, the set of 
ground-state configurations does not have any closed translation-invariant proper sub- 
set, hence it is formed by all the translates of a single (non-periodic) configuration, and 
limits of such. The non-zero residual entropy implies that any two such translates must 
differ in infinitely many sites, and hence they all must be rigid ground states. However, 
this phenomenon can happen only in the presence of infinite-range interactions (albeit 
decreasing arbitrarily fast with the range) [304]. 

B.2.5 Superfluous Ground-State Conflgurations 

The last type of ground-state configurations are those that are not in the support of 
any zero-temperature Gibbs measure. These are obviously the least interesting ones, 
and we shall call them superfluous ground-state configurations. The most immediate 
example is provided by the one-dimensional ferromagnetic Ising model. Its ground-state 
configurations are the all-"-|-", all-"—" and the fiat-interface configurations. However, 
all the zero-temperature Gibbs measures are of the form A(5_|_ -|- (I — A)(5_; the fiat- 
interface configurations (B.6) are superfiuous. Heuristically this is because the interface 
is free to wander at no energy cost; in the infinite- volume limit it wanders to iboo. The 

proof goes as follows: Denote by X-\ xo the indicator function of the configuration 

which is -|-I for x < Xq and —I for x > Xq. Then, for A = [—N, N], the measures (B.I) 
yield 



TT 



$,T=0/ 
A \ 



[ I/(2iV + 2) if T_(N+1) = +1, TN+1 = 

id -N<xo<N + l 
otherwise 



ctiil'-i- -i-T ^tt/|i ^j.1 I _i_ z" T^ 1 1 ^ 

B.ii , 



[The first line is due to the 2A^-|-2 possible positions Xq for the "kink" (lack of rigidity).] 
Therefore, if // is a zero-temperature Gibbs measure, then for every N > |a;o| we have 



, <^,T=Of I 

P\X+-xo) = /^ TTa (X+-^ol 
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-/i(cu_(jv+i) = +1 and un+i = -1) 



2N + 2' 

< i . (B.12) 

- 2N + 2 ^ ' 

Letting A^ — )• oo we conclude that fi{x-\ xo) = 0. The same holds, of course, for the 

flat-interface configurations which go from — to +. As the ground-state configurations 
here form a countable set (labelled by Xq G [— oo, oo] and the polarity of the kink), its 
measure is the sum of the measure of each of its points. Therefore, (B.f2) implies that 
fi gives full measure to the set formed only by the all-"-|-" and all-"—" configurations; 
the fiat-interface configurations are superfiuous. Combining this with the results stated 
above, we conclude that the fiat-interface configurations (B.6) are superfiuous in J = 1, 
and rigid in d > 2. 

The preceding argument requires not only that there be a growing degeneracy in 
the position of the interface, but also that the number of ground-state configurations 
be not too large, i.e. at most countable. This second fact is not true for higher dimen- 
sions. In dimension two, for instance, the "staircase-interface" configurations form an 
uncountable set. To be sure, the set of staircases with finitely many stairs is countable, 
and the above argument can be used to prove that this set has measure zero for all 
zero-temperature Gibbs measures. This would prove that the support of such mea- 
sures is always contained in the set formed by the all-"-|-", all-"—", fiat-interface and 
infinite-staircase configurations (this being a closed set whose complement has measure 
zero). But it does not rule out the occurrence of non-deterministic zero-temperature 
Gibbs measures supported on infinite-staircase configurations, similarly to what may 
happen in the three-dimensional Ising model for configurations close to cu^'^"^^^. Nev- 
ertheless, we must keep in mind that we are primarily interested in those features of 
zero-temperature phase diagrams that survive at (can tell us something about) low 
but nonzero temperature. Therefore, for the two-dimensional Ising model the possible 
existence of such a zero-temperature Gibbs measure with support on infinite-staircase 
configurations is a rather irrelevant issue, since it has been proven [1, 192] that only 
the Gibbs measures of the form X6^ -|- (1 — A)(5_ "survive" at non-zero temperatures. 
The question of non-deterministic Gibbs measures becomes really important only for 
dimension d > 3. 

B.2.6 Nonuniqueness of Specifications and Interactions 

We shall now comment on one important difference between the zero-temperature and 
nonzero-temperature formalisms: At zero temperature the "inverse problem" — given 
a measure, determine the specification and/or the interaction — is no longer well-posed: 
the map from interactions (or specifications) to zero-temperature Gibbs measures is, 
in general, many-to-one. This lack of uniqueness appears at three different levels: 

A) There are measures consistent with several different zero-temperature specifi- 
cations simultaneously. Theorem 2.15 does not apply at zero temperature because 
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there are (large) open sets having zero measure for all zero-temperature Gibbs mea- 
sures. Therefore, by redefining the specification more or less arbitrarily on such open 
sets we can obtain several different specifications for the same zero-temperature Gibbs 
measure. Let us present an explicit example. Consider the nearest-neighbor Ising 
model with formal Hamiltonian —JY,(xy)'^x'^y — hY.x'^x- Then the measure 6^ is a 
zero-temperature Gibbs measure for the following zero-temperature specifications: 

f . The specification 7r*i'^=° where $i is defined by J = and some h > 0. 

2. The specification 7r*2,T=o -y^j-^ere $2 is defined by some J > and h = 0. 

Nevertheless, n*i'"^~° 7^ j];^2,t=o ]3gcai;^sg ^j-^e former has S^ as its only zero-temperature 
Gibbs measure, while the latter has both 6^ and (5_ as zero-temperature Gibbs mea- 
sures. 

B) There are zero-temperature specifications which arise from several non-physically- 
equivalent interactions (in other words, the notion of physical equivalence becomes 
meaningless at T = 0). We give two examples: 

f. Trivial example: Consider any interaction $ and any number A > 0. Then $ 
and A$ are not (usually) physically equivalent, but they have the same zero- 
temperature specifications. 

2. Less trivial example: All Ising-type pair interactions (not necessarily ferromag- 
netic) such that hx > Y,y \Jxy\ for all x give rise to the same zero-temperature 
specification, namely the one that for each finite set A and every boundary con- 
dition gives, inside A, the measure concentrated in the all-"-|-" configuration. 

C) The variational principle (Section B.2.8 below) reduces to the minimization 
of the specific energy, which is not a strictly convex functional on B^ or any of its 
subspaces Bh- 

B.2.7 Stability and w-Stability 

Zero temperature is in itself unattainable. So one really is interested in those zero- 
temperature features that "survive" at low but nonzero temperatures. For instance, 
one is interested in determining which are the measures that can be obtained as a 
/3 — )• 00 limit of positive-temperature Gibbs measures for a fixed interaction $. We 
shall refer to these measures as stable measures for the interaction $. It is simple to 
check that all these stable measures must be zero-temperature Gibbs measures for $: 

Theorem B.ll Let fin be Gibbs measures for a fixed interaction $ and a sequence 
of inverse temperatures fin with fin — ^ -|-oo. // //„ — > //, then the measure fi is a 
zero-temperature Gibbs measure for $. 

However, not every zero-temperature Gibbs measure for a given potential is neces- 
sarily stable. We can illustrate this concept with the case of the Ising model. For J = I, 
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none of the deterministic zero-temperature Gibbs measures [6^ and (5_) are stable. In 
fact, the only stable measure is [6^ + <^-)/2. For the Ising model in dimension 2, only 
the measures of the form X6^ + (1 ~ A)(5_ are stable. The deterministic Gibbs measures 
associated with the rigid flat-interface configurations are unstable: at any nonzero tem- 
perature, the interface "wanders" to iboo and we are left with a convex combination 
of the "-|-" and "— " phases [I, 192]. For dimension 3, the fiat-interface measures were 
proven to be stable by Dobrushin [85] (see also [348]). Remark: The low-temperature 
Gibbs measure for the fiat-interface phase seems, at least in numerical experiments, 
to disappear at a temperature strictly below the critical temperature, giving rise to 
a roughening transition. If the above-mentioned zero-temperature measure supported 
near the configuration (jj^^s-^a.s happens to exist, we could ask two questions: (i) Does 
it survive for T > (stability)?; and, if so, (ii) Does it fail to survive to T = Tc"^. If 
the answer to both questions were yes, then the Ising model would exhibit a second 
roughening transition. 

However, as our eventual goal is the study of how the full phase diagram deforms 
as the temperature is raised, we must consider a more general situation in which the 
interaction is also varied as the temperature goes to zero. That is, we must con- 
sider the more general class of measures that can be obtained as a /3 — )• oo limit of 
positive-temperature Gibbs measures for interactions $„—)•$. We shall refer to such 
measures as w-stahle measures for the interaction $ [95]. [Warning: Reference [328] 
calls these measures stable.) In general, such measures need not he zero-temperature 
Gibbs measures for $. For example, if to the antiferromagnetic Ising model with field 
h = 2d\J\ considered above we add an additional field hn = a/jSm we obtain, in the 
limit /3„ — )• oo, a Gibbs measure corresponding to a lattice gas with chemical potential 
a. All these measures are different among themselves, and different from the unique 
zero-temperature Gibbs measure for the Ising antiferromagnet in a field h = 2J|J|, 
which corresponds to a = 0. A more dramatic example would be to add, to the same 
model, a field h = l/^/]3^. The measure obtained in the limit /3„ — )• oo would then 
be the measure S^ (all the conditional probabilities 7rJ~° are equal to (5_|_), which is 
not a zero-temperature Gibbs measure for $ because there is no rigid ground-state 
configuration for this model. This example shows that the notion of w-stability is per- 
haps a little too general; for interesting applications one usually constrains oneself to 
w-stability with respect to a pre-fixed set of perturbed interactions. In Section B.3.2 we 
shall make precise the desirable properties of such perturbations. 

At any rate, it is immediate that all w-stable measures have the weaker property of 
being supported on the ground-state configurations for the given interaction, i.e. they 
are weak zero-temperature measures: 

Theorem B.12 Let fin be Gibbs measures for a sequence of interactions $„ and a 
sequence of inverse temperatures fin such that $„—)•$ and fin — ^ -|-oo. // //„ — > //, 
then 

//({ground-state configurations for $}) = 1 , (B.13) 

i.e. fi is a weak zero-temperature mea.sure for $. 
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The notion of zero-temperature entropy involves a zero-temperature limit, hence 
it must have something to say about stability. Indeed, Aizenman and Lieb [8] have 
proven the following: 

Proposition B.13 If fi is a stable translation-invariant zero-temperature Gibbs mea- 
sure for $, then 

s(/i) = s$ . 

This result, together with Theorem B.ll and the fact that the set of translation- 
invariant Gibbs measures is non-empty at all temperatures, proves Proposition B.8 
(b) above. In the case of super-degenerate systems (i.e. systems for which s$ > 0), 
Proposition B.13 can be used to rule out the stability of some measures: 

Corollary B.14 For a super-degenerate system., every translation-invariant zero-temperature 
Gibbs measure supported on a finite set is unstable. 

For instance, for the system (B.9), the all-"0" Gibbs state is unstable. 

B.2.8 Variational-Principle Approach 

The variational-principle approach for zero-temperature measures was historically the 
first one to be considered [311]. At zero temperature it provides an even simpler crite- 
rion than at non-zero temperatures, because it reduces to a minimal-energy condition 
[F = E — TS reduces to F = F if T = 0). For a translation-invariant interaction $, 
it is not hard to show that the limit 

^^ - iT m l^o^ ^ ^^("^^ (^-^^^ 

A/oo |A| c^e^A ^ ^ 

exists; we call it the minimal specific energy (or ground-state energy). A translation- 
invariant measure // satisfying 

/i(/$) = e$ (B.15) 

is called a zero-temperature equilibrium. m.easure for the interaction $. The definition 
can be extended to periodic measures if /$ includes an average over all the sites of a 
basic period: If $ is invariant under a subgroup S* of Z , with Z / S isomorphic to a 
finite set P G I- , then one must define /$ = \P\~^ Y^xeP^X3x l^l~^ ^x- We shall 
assume this extension in the sequel. Schrader has proven [319]: 

Theorem B.15 

(a) Fvery translation-invariant w- zero-temperature mea.sure for ^ is a zero-temperature 
equilibrium. m.easure for ^ , i.e. satisfies (B.15). 

(b) Conversely, every zero-temperature equilibrium. m.easurefor^ is a w- zero-temperature 
mea.sure for $. 
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That is, for translation-invariant measures to be supported on (local) ground-state 
configurations is equivalent to having minimal average energy density. We notice that, 
unlike the finite-temperature case, we do not have an equivalence between the varia- 
tional and the Gibbsian-specifications approaches; only the more general w-measures 
appear in the previous theorem. The relationship between zero-temperature Gibbs 
measures and equilibrium measures is much more problematic. 

The variational approach yields also a characterization of periodic ground-state 
configurations: 

Theorem B.16 

1. For any periodic configuration io ^ ft, the specific energy (energy per site) 

ec^H = i^i^^^^H (B.f6) 

exists. 

2. The infimum of e^[(jj) over all periodic configurations u is finite and equals the 
value e$ defined in (B.I4). 

3. io is a periodic ground-state configuration if and only if e^[(jj) = e$ [328, 95]. 

For completeness, we mention also two variational principles involving the minimal 
energy density and the residual entropy: 

Theorem B.17 

(a) 



(b) [8] 



sq, = sup{s(/i) I /i G M+i^per(0, JT) and //(/$) = e$} (B.fSa) 

= sup{s(/i) \ fi is a w-zero temperature measure for $} . (B.fSb) 



In particular, (B.I8b) proves Proposition B.8(a). The "inf" in part (a) and the "sup" 
in part (b) are in fact "min" and "max", respectively. They are realized by the zero- 
temperature equilibrium measures. 
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B.2.9 Infinite Range and Lack of Quasilocality 

The variational-principle approach to zero-temperature classical lattice systems can 
be extended without difficulty to interactions in S° [319, 8]. The extension of the 
DLR approach to infinite-range interactions (e.g. in B^) is, however, more problematic. 
In particular, the validity of the important Theorem B.ll is an open question: A 
sequence of positive-temperature Gibbs measures for $ could conceivably converge 
to a limiting measure that is not consistent with the zero-temperature specification 
(B.l). If this latter specification were quasilocal, such a phenomenon could not occur 
[157, Theorem 4.17]; however, for long-range interactions the specification (B.l) is in 
general not quasilocal. Let us conclude this section with an example showing this lack 
of quasilocality. 

Consider any long-range one-dimensional Ising model with pair interactions J^y = 
J\x-y\ satisfying Xlnl'^ral < oo- The model has to be truly long-range in the sense 
that there must be infinitely many nonzero couplings J„; for simplicity of notation we 
assume that J„ 7^ for all n. We claim that the zero-temperature specification of such 
a model is non-quasilocal. Indeed, the zero-temperature conditional probability for the 
spin at the origin satisfies: 



TT 



{0} 



[^0 



+i|t1 



1 if J2x^o JxT^cy 

1/2 if Ex^oJ.rx = 

if J2x^0 JxTx < 



:B.19) 



To prove that this is not a quasilocal function of the boundary condition r, we need 
to show that there exists some e > for which the following is true: For an infinite 
sequence of nested finite sets A there exist two open sets of configurations. A/a and A'^, 
formed by configurations which are all identical inside A, but such that 
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$,T=0/ 
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+ 1|t' 



> e 
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if T G Aa and r' G A"^. Such sets Aa, A"^ are constructed as follows: Take A 
[-N, N] and fix A^o > A^ such that 



N 



7 , \Jx\ < Kjv-i-i| 7 

x>No 

and let Aa be the set of configurations r such that 



:B.2r 



+ 1 if 1 < X < A^ 

-1 if - A^ < X < -1 

sgn J^ if A^-M < |a;| < A^o 

anything if \x\ > Nq . 



:b.221 
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The set set A/"^ is defined analogously but replacing sgn J^^ by — sgn J^. We then have: 



/ J •Jx'Tx — / J •Jx'Tx — \ 
xi^O \x\>N 



No 

2 Yl \JA + '^ J2 J^^^ > for T G A/'a 

|:e|=JV+1 \x\>No 



No 

-2 Y. 1-^-1 + 2 E J-^- < forrGA/"; 

|:e|=JV+1 \x\>No 



where the last inequalities follow from (B.2f). Therefore, by (B.19), 



A 5 

B.23) 



7rJ;;[=°(cuo = +1|t) - 7rJ;;[=°(cuo = +1|t')| = 1 (B.24) 

if T G A/a and r' G A/'^, and the specification is not quasilocal. 

B.3 Phase Diagrams 

B.3.1 Regular Phase Diagrams 

The words "phase diagram" are usually associated with nice pictures in which two 
conditions are satisfied: 

1) Only periodic extremal Gibbs measures are considered. We emphasize that 
the order of the qualifiers has been carefully chosen: the measures relevant here are 
those extremal Gibbs measures that happen to be periodic; we are not referring to 
the measures that are extremal among the periodic ones (this latter is a larger and 
less-well-behaved class). For short, we shall call these measures pure phases^ but we 
emphasize that this embodies a double change with respect to the terminology adopted 
in the rest of this paper: First, we consider all periodic Gibbs measures on the same 
footing, whether they are invariant under the whole translation group Z or merely a 
nontrivial J-dimensional subgroup of it. Second, we invert the order of the qualifiers, 
that is, we call pure phase an extremal measure in the sense of (ii) in Section 2.4.9, 
rather than in the more customary sense (iii). 

We shall fulfill this condition throughout the rest of this appendix: by "phase 
diagram" we will mean the partition of a certain parameter space into regions with a 
given number and type of pure phases. 

2) The Gibbs phase rule [363] is obeyed. Let us explain in a little more detail 
what this means. An example of a phase diagram satisfying the Gibbs phase rule is 
presented in Figure 13 below: There is a point where three pure phases coexist [point 
of maximal coexistence)^ from which there emanate three lines where two pure phases 
coexist, which in turn bound three open regions in which there is only one periodic 
extremal Gibbs measure. Such a phase diagram will be called regular. More generally, 
an r-regular phase diagram, consists of [170, Appendix A]: 

(1) a point of maximal coexistence where r pure phases coexist; 
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(2) r one-dimensional open manifolds, each bounded by this maximal-coexistence 
point, where exactly r — 1 phases coexist; 

(3) r[r — 1)/2 two-dimensional open manifolds, each bounded by pairs of the previous 
one-dimensional manifolds, where exactly r — 2 pure phases coexist; 



(r) r open (r — l)-dimensional manifolds, each bounded by the (r — 2)-dimensional 
2-phase-coexistence manifolds, and such that the closure of their union is the 
whole parameter space, where there is only one pure phase. 

Usually, the pure phases are defined by fixing the boundary conditions according to 
some parameter-independent set /C of reference configurations (or, more generally, mea- 
sures). Typically, /C is the set of ground-state configurations at the point of max- 
imal coexistence at T = 0. One can then label each pure phase according to the 
boundary condition employed in its definition. One calls the K-stratum [K C /C) 
the manifold in parameter space where the coexisting phases are precisely those la- 
belled by elements of K . For instance, in Figure 13, the different strata are labelled 
by the boundary conditions "-|-", "0" and "— ". There are, therefore, seven strata: 
{ + }, {0},{-},{ + ,0}, { + ,-}, {0, -}, { + ,0, -}.^ ^ 

A more abstract (topological) way of visualizing such a phase diagram is provided 
by the following equivalent characterization: a r-regular phase diagram is a diagram 
that can be homeomorphically mapped onto the boundary of the positive octant in r 
dimensions, 

dQ, = |(ti,...,t,) G R>o: mint, = 0|, (B.25) 

in such a way that the point of maximal coexistence corresponds to the origin, the 
curves of (r — l)-phase coexistence correspond to the positive coordinate axes excluding 
the origin, . . . , the open sets with only one pure phase correspond to the (r — 1)- 
dimensional coordinate hyperplanes excluding their (r — 2)-dimensional boundaries. In 
brief, the different strata are mapped into the different submanifolds of the boundary 
of the r-octant. 

General phase diagrams need not obey the Gibbs phase rule. A typical situation is 
for some of the pure phases to always appear together throughout the diagram. Such 
a situation is called a degeneracy^ and it is usually associated to some symmetry of 
the system (if no symmetry can explain it, the degeneracy is called fortuitous). The 
addition of further interactions (not respecting the symmetry) can produce a phase 
diagram without degeneracy. These extra interactions are said to break the degeneracy 
of the pure phases in question. An interaction is said to completely break the degeneracy 
of the pure phases if its addition yields a regular phase diagram. 

B.3.2 Zero-Temperature Regular Phase Diagrams 

For zero-temperature phase diagrams, it is relatively simple to give conditions on the 
extra interactions needed to ensure a regular phase diagram. Indeed, at zero tem- 
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perature degeneracy means equal specific energy for all values of the parameters, and 
its breaking involves adding interactions producing a different set of specific energies 
for each of the initially degenerate pure phases. This is usually done perturbatively, 
that is, each additional interaction is multiplied by an overall "turn-on" parameter. 
Suppose we start with an interaction $o having r degenerate zero-temperature pure 
phases /ii, . . . /i^. Then, to completely break the degeneracy one usually considers r — 1 
additional interactions $i, . . . , $r-i and constructs the "perturbed" interactions 

8 = 1 

[Examples: (i) For the Ising model at zero field, Ai = h; (ii) for the Blume-Capel 
interaction defined by (B.34) below, Xi = g and X2 = h in the "perturbation" (B.35).] 
The parameters A = (Ai, . . . , A^-i) usually take values in a certain neighborhood of 
the origin. The degree of degeneracy for the perturbed interaction $- depends on the 
r-tuple of specific energies 

e(A) = (e<3>_(/ii), . . . , e<3>_(/i^)) . (B.27) 

In fact, if we denote 

Q{X) = {i : fii minimizes e$_(/i)} . (B.28) 

then the strata of the zero-temperature phase diagram are the sets 

Sk = {X:Q(X) = K} (B.29) 

for each subset of labels K C {1, . . . , r}. The perturbed interaction completely breaks 
the degeneracy if the phase diagram formed by the strata (B.28) is r-regular. 

It is of interest to translate the requirement of regularity into conditions on the 
perturbations $i. One way to do it is to notice that, as the specific energy depends 
linearly on the parameters Aj-, it can be written in the form 

e(A) = X]A,e($0, (B.30) 

8 = 1 

with 

e($,) = (e$,(/ii),...,e$,(/i.)). (B.31) 

One of the conditions for the phase diagram to be r-regular is that the origin A = 
be the only point of maximal coexistence. This implies that no nonzero vector of the 
form (B.30) can have all its coordinates equal. A little bit of linear algebra shows that 
all the other conditions for regularity are satisfied if the vectors {e($j)}i<j<r-i are, 
in addition, linearly independent. Therefore, the perturbations $1, . . . $r-i completely 
break the degeneracy o/$o if and only if the vectors e[^i) are linearly independent and 
they do not span the vector (1, . . . , 1) G R*^. 

Alternatively, if we resort to the previous geometrical description of regularity, we 
conclude that it is equivalent to require that the vector e(A) — shifted so it always has 
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at least one coordinate equal to zero — sweeps over the boundary dQr of the positive 
octant. Precisely stated, if we denote 

e-(/^0 = e$-(/iO - min e$ (/ij) , (B.32) 

A l<j<r ^ 

the perturbation $- completely breaks the degeneracy of $o if 'ind only if the map 

A^ (e-(/ii),...e-(/i,)) (B.33) 

is one-to-one. In other words, if such a map is a bijection from, a neighborhood of 
G R ~ to a neighborhood o/ G dQr- For each particular value of A, the coexisting 
pure phases are those fii with e-[fii) = 0. 

B.3.3 Low- Temperature Phase Diagrams. Scope of Pirogov-Sinai Theory 

If nature is fair, one expects that low-temperature phase diagrams look very similar 
to the corresponding zero-temperature ones. This is not always so, however, and the 
question of stability or w-stability of Gibbs measures is an important issue. Pirogov- 
Sinai theory has been precisely designed to single out some important cases in which 
indeed the low-temperature diagrams are only a small deformation of the ones at zero 
temperature. When the theory applies, one is guaranteed that the regularity of the 
diagram is preserved at least for small temperatures; and, furthermore, that the low- 
temperature pure phases look very "similar" to the zero-temperature ones. 

As an input to the Pirogov-Sinai theory one must determine the zero-temperature 
phase diagram and show that two key hypotheses are satisfied. The first hypothesis 
refers to the number of zero-temperature deterministic pure phases: In its original 
version [296, 297], Pirogov-Sinai (PS) theory applies to a system with a finite-range 
periodic interaction, exhibiting a finite number of periodic rigid ground-state configu- 
rations. (This has subsequently been generalized to some extent: see Section B.4.4.) 
The second hypothesis is that the interaction satisfy the so-called "Peierls condition", 
to be stated more precisely below, which roughly requires that for each rigid periodic 
ground-state configuration the energy cost of introducing a droplet of spins aligned 
as in a different ground state must grow typically as the area of the boundary of the 
droplet. This condition allows the energy cost of creating excitations to beat the en- 
tropy gain, preserving the long-range order observed at zero temperature. However, 
the Peierls condition has this desired effect only for d > 2. The trouble is that for d = 1 
the size of the boundary of a set does not grow with its volume. Therefore, Pirogov- 
Sinai theory is not applicable to one-dimen.sional models. On the other hand, for d > 2 
the Peierls condition is certainly stronger than necessary^^: there exist models with 
a finite number of rigid periodic ground-state configurations which have a non-trivial 
phase diagram and which do not satisfy the Peierls condition [290, 264]. Nevertheless, 



''in some sense it is the strongest possible condition: see the comments after Definition B.19 below. 
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the Peierls condition applies in a large number of interesting models, and allows a very 
precise description of the low-temperature behavior. 

The output of the theory is a family of results involving extensions to non-zero 
temperatures. The main result of the theory is that for a system satisfying the Peierls 
condition the phase diagram involving these periodic deterministic measures is stable: 
As the temperature increases, the coexistence manifolds deform continuously (in fact 
analytically). Moreover, the theory makes rigorous the intuitive picture of what each 
low-temperature pure phase looks like: its typical configurations consist of a "sea" of 
spins aligned as in the ground-state configuration with small and sparse "islands" of 
overturned spins. 

We remark that the theory does not have anything to say about the stability of 
the (possibly infinitely many) non-periodic ground-state configurations and the zero- 
temperature Gibbs measures they support (but see Section B.4.4). Other techniques 
are needed to show, for example, that the fiat-interface ground-state configurations 
(B.6) — which are rigid for d > 2 — are unstable for d = 2 [141, 1, 192] and stable for 
d>3 [85, 348]. 

Moreover, the original version of PS theory gives only very limited information as to 
the specifics of the deformation of the phase diagram; in particular it does not produce 
a useful criterion to determine which pure phases are stable for the different regions 
of the zero-temperature phase diagram. That is, it does not tell us in which direction 
the phase boundaries move when the temperature is raised from zero. Therefore, 
for interactions $ lying on a phase-transition manifold of the zero-temperature phase 
diagram, the original PS theory does not tell us in which phase(s) $ ends up at T > 
0; that is, it does not tell us which one(s) of the coexisting zero-temperature pure 
phases is/are stable, and which are only w-stable for the family of interactions adopted. 
However, Slawny's extension of PS theory [329] provides this additional information. 

To clarify this point, let us borrow a very instructive example from the review by 
Slawny [329]. Consider the spin-1 Blume-Capel model defined by the formal Hamilto- 
nian 

Ho = lJ2i^^-^yf ^ (B-34) 

{xy) 

where uj^ = —1,0, 1 and the sum is over pairs of nearest-neighbor sites in Z , J > 1. 
Such a model has three periodic (in fact translation-invariant) ground-state configu- 
rations: all-"-|-", all-"0" and all-"—". They are all rigid. To obtain a 3-regular phase 
diagram one can consider, for instance, the family of interactions defined by the formal 
Hamiltonians 

H{g,h) = Ho-gJ2^l-hJ2^^- (B-35) 

X X 

The corresponding zero-temperature phase diagram is presented in Figure 13(a). Pirogov- 
Sinai theory tells us that for T > low enough the phase diagram is just a continuous 
deformation of the one depicted, but to conclude that such deformations look as in 
Figure 13(b) we need some extra information which is not directly obtainable from PS 
theory, although it is probably contained in it. This extra information is presented ex- 
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plicitly, for instance, in Slawny's theory of asymptotics of phase diagrams [329]. From 
the latter diagram we see, for instance, that of the three deterministic pure phases of 
H[g = 0, /i = 0) only the all-"0" is stable, while the other two pure phases are w-stable. 
The well-studied ferromagnetic Ising model provides an example of an exceptional na- 
ture: its phase diagram remains undeformed at low temperatures; for all values of the 
magnetic field the periodic zero-temperature Gibbs measures are stable. 

B.4 Pirogov-Sinai Theory 

We summarize now the main aspects of PS theory. In the first two subsections we 
carefully discuss the basic hypotheses required by the theory; in the third subsection 
we present a somewhat detailed account of the results (for finite-range interactions). 
Of course, we omit all proofs; these can be found in the references cited. As already 
pointed out, the theory does not apply for J = 1, therefore in the rest of this appendix 
we restrict ourselves to d > 2. 

B.4.1 Boundary of a Configuration. The Peierls Condition 

Typical configurations of a low-temperature pure phase are expected to be small fiuc- 
tuations around those of a corresponding zero-temperature pure phase. These fiuctua- 
tions result in the appearance of "droplets" ("bubbles", "islands") of spins aligned ac- 
cording to a different zero-temperature pure phase — or, more generally, a "metastable 
phase" [369] as we discuss below. These droplets are surrounded by a transitional re- 
gion or "boundary" of sets of spins not aligned according to any zero-temperature pure 
phase, which therefore raises the energy of the configuration. The probability of such 
fiuctuations is determined by the competition between two factors: the energy cost 
of introducing a boundary and the entropy gain due to the different possible shapes 
and locations of the droplets. If the energy cost is large enough to overcome, at low 
temperatures, the entropy gain, then each zero-temperature pure phase gives rise to 
a low-temperature one which differs only in the presence of few and small droplets 
of overturned spins. In particular, this would prove that there are precisely as many 
coexisting pure phases at low temperatures as there are at zero temperature, and 
hence that there is a phase transition. This type of argument was first introduced by 
Peierls [291, 82, 168] to prove the existence of a phase transition in the J-dimensional 
Ising model for d > 2^ and hence it is often referred to as the "Peierls argument". 
Pirogov-Sinai theory is a generalization (and, thus, a more abstract version) of such 
an argument. 

To formulate the Peierls argument in a rigorous form we need a criterion to deter- 
mine when the energy cost of a boundary is "large enough" to defeat the entropy gain. 
The Peierls condition is precisely one such (sufficient) criterion. It relies, however, on a 
suitable definition of the "boundary" of a configuration, which is not a uniquely defined 
concept. In fact, two complementary notions are introduced at this stage: the bound- 
ary^ which roughly corresponds to the collection of sites where the spins are misaligned, 
and the contours^ which are the different components of this boundary together with 
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Figure 13: Phase diagrams of the model with interaction (B.35). (a) Zero temperature, 
(b) Low temperature. 
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the corresponding spin configurations. The latter allow a complete determination of 
the energy of a given configuration. 

Let us motivate the general definitions via examples. In the original case of the fer- 
romagnetic Ising model, the boundary of a configuration can, for instance, be defined as 
all the pairs of nearest-neighbor sites with opposite spins. As each such pair contributes 
equally to the energy of the configuration, regardless of which spin of the pair is up and 
which is down, one does not need to specify the actual configuration on the boundary 
to compute the energy. Therefore contours are defined with no reference to configura- 
tions, by considering the polyhedral surface formed by plaquettes perpendicular to the 
bonds joining misaligned spins, and taking its connected components [82, 168]. The 
same definition of boundary works for the Blume-Capel model (B.34), but to compute 
the energy we now must specify the configuration of each pair of misaligned spins, 
as different combinations have different energies. The definition of contours requires 
hence to consider polyhedra labeled by the configuration on the immediately adjacent 
(internal and external) shells of spins. The next complication appears for models with 
interactions extending beyond nearest neighbors and/or involving more than two spins 
at a time. An example of practical interest is the antiferromagnet on a face-centered 
cubic lattice [329, p. 145 and references therein]. Such models require "thicker" bound- 
aries and contours defined specifying the configurations of larger groups of spins. 

Therefore, to define boundary and contours in a general fashion, we must check 
whether sets of spins are aligned or misaligned, but this checking has to be done on 
sufficiently large collections of spins at a time. Following closely [328, Chapter II], we 
consider a set K, = {cu^^\ . . . ,i!l^^'} of periodic configurations (r > 1). For the standard 
statement of the Peierls condition /C will be the set of periodic (deterministic) ground- 
state configurations of some interaction, but the definition can be done (and must be 
done, as we shall discuss in next section), for general sets of periodic configurations. 
Let us call /C the set of reference configurations [369]. We also consider for some fixed 
a > the cubes Wa{x) = {j/ G Z : , |j/i — Xi\ < a for 1 < i < d} — the sampling cubes. 

Definition B.18 The boundary of a configuration u — with respect to the set of ref- 
erence configurations K, and sampling cubes Wa{x) — is the set of sites 

dto = [j iyWa{x): to\wa{x) ^ ^kaW ytsl^K^] ■ (B.36) 

Typically we will consider configurations u equal to some cu G /C except for a finite set 
of spins. In this situation the boundary is a finite set. 

Let us now state the simplest and most popular version of Peierls condition; in the 
following section we discuss a more general definition. We consider an interaction $o 
and, for each fixed cu G /C construct the relative Hamiltonian 

H^'{uj\ui)= Y. [^oA(tu) - $oA(tu)] , (B.37) 

A-.ACI'^ finite 

defined only for configurations u coinciding with cu except on a finite set. Let us denote 
^T=o(^o) the set of periodic ground-state configurations of $o- 

220 



Definition B.19 The interaction $o satisfies the (original) Peierls condition if there 
exists a constant /Jq > such that for each uj_ G ^x=o(^o) 



a '{^\'-2) > Po\dio\ (B.38) 

for every configuration u coinciding with uj_ except possibly on a finite set of sites. Here 
did is the boundary of io with respect to K, = ^x=o(^o) 'ind sampling cubes defined by 
some fixed choice of a > 0. 

We shall call a constant po satisfying (B.38) a Peierls constant for the interaction $o 
(and the chosen /C and a). The Peierls condition immediately implies that each periodic 
ground-state configuration is rigid, and hence defines a deterministic zero-temperature 
pure phase (Section B.2.3). The converse is not true [290, 264]. We also notice that 
an upper bound of the form H^°{io\i!i) < po\d(jj\ is always true, hence the Peierls 
condition is basically a requirement for the energy cost to grow as fast as possible 
with the size of the boundary of the configuration. There are important models where 
this is not true, i.e. in which the energy cost grows more slowly than the area of the 
boundary: for example, the balanced model [132, 48] and the ANNNI model [48, and 
references therein]). 

We notice that the validity of the Peierls condition does not depend on the particular 
choice of the parameter a > adopted for the definition of the boundary, but the 
actual value of the Peierls constant does. Indeed, following [328] we notice that if d'co 
indicates the boundary defined via sampling cubes Wa' with a' > a (other cases left to 
the reader), then 

duj C d'uj C U^ed^Waix) (B.39) 

thus, 

\duj\ < \d'uj\ < {2a' + lY\duj\. (B.40) 

Therefore, different choices of a change the actual value of po: 

< Pq = posup—— < Po , (1^-41) 



(2a' + 1)^^ - "^ "" J \d'io\ 

but not its nonzero character. One has the freedom of adjusting a according to future 
convenience. However, the actual value of po is related to the range of temperatures 
where PS theory is valid (this range is proportional to po). Hence, for the sake of quan- 
titative predictions one should employ a value of po as large as possible, which means 
a as small as possible. An extremely favorable case is exemplified by the ferromagnetic 
Ising model, and its generalizations to higher spins, for which the boundary of config- 
urations can be defined via polyhedra of "zero width" [and we may even have equality 
in (B.38)]. Strictly speaking, the corresponding "zero- width" (or thin) contours are 
not included in the formalism to be introduced below, but we shall keep them within 
our discussion through appropriate comments. 

The actual verification of the Peierls condition is a model-dependent, generally 
nontrivial, procedure. The starting point is, in principle, the determination of all 
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periodic ground-state configurations — an often tedious process. A siiglit simpiification 
foiiows from tlie observation tliat if we find that (B.38) is satisfied for some finite set 
K, of periodic configurations, automatically these must he all the periodic ground states. 
Indeed, (B.38) implies tliat sucli configurations cu are ground states, and if tliere were 
otliers (B.38) would not be satisfied because arbitrarily large boundaries could be 
constructed without extra energy cost, simply by interposing regions occupied by the 
ground states not accounted for. In practice, this observation is of little help, as the 
determination of ground states is made using some sort of contour ideas, so checking the 
Peierls condition and finding the ground-state configurations are almost simultaneous 
processes (however, see [215]). The only real shortcut available is a sufficient condition 
due to Holsztynski and Slawny [195] which we will use for almost all the applications 
in this paper. 

Definition B.20 A potential $ is an ?7i-potential if there exists a configuration u 
simultaneously minimizing each \ A\ -body function : 

$^(cu) = mm<!>A{Oj) WAeS. (B.42) 

For such an interaction let us denote by CJy_o($) the (nonempty) set of configurations 
minimizing all $^. The sufficient condition is: 

Tlieorem B.21 (Holsztynski-Slawny) A finite-range m-potential $ with Qx=o{^) 
finite satisfies the Peierls condition. 

Resorting to an alternative — and suggestive — terminology, we can say that an 
?7i-potential is one for which there are ground states "satisfying" all bonds. An im- 
mediate example is any Ising model with ferromagnetic interactions ($^ = — J^cr^ 
with Ja > for all A): clearly the all-"-|-" configuration simultaneously minimizes all 
$^. The opposite case is that of the potentials with "frustration", i.e. for which every 
configuration has bonds that give an energy contribution larger than the minimum 
possible ("frustrated bonds"). However, these notions of ?7i-potentials and "frustra- 
tion" must be taken modulo physical equivalence, because equivalent potentials have 
the same statistical-mechanical properties. This adds an extra twist to the matter. A 
popular example is the antiferromagnetic Ising model in a triangular lattice. It is easy 
to see that when the model is given its usual formulation in terms of two-spin inter- 
actions, no configuration can "satisfy" simultaneously the three bonds of a triangular 
plaquette. But this seemingly frustrated potential can equivalently be written by con- 
sidering the triangular plaquettes themselves as the bonds, with an energy contribution 
obtained by a suitable combination of the contributions of the original two-spin bonds 
around the plaquette. In this formulation the model is now an ?7i-potential (although 
one cannot use Theorem B.21 because there are infinitely many periodic ground-state 
configurations). In this regard, probably the most difficult aspect of the application of 
this very convenient theorem is the verification of whether the potential of interest can 
be rewritten as (i.e. is physically equivalent to) an ?7i-potential. It would be very nice 
to complement Theorem B.21 with some simple sufficient criterion for an interaction to 
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be physically equivalent to an ?7i-potential, but this may not be an easy task. For in- 
stance, the natural conjecture that every finite-range translation-invariant interaction 
is equivalent to a (translation-invariant finite-range) ?7i-potential is false [263]. 

At any rate, once the ?7i-potential character has been verified. Theorem B.21 is an 
extremely convenient tool. It has, however, an important drawback: its proof is not 
constructive, so it does not provide any explicit expression for the Peierls constant. 
Therefore, arguments based on this theorem do not allow any determination of the 
range of temperatures where the PS theory remains valid. 

B.4.2 Contours. The Generalized Peierls Condition 

In the presence of the Peierls condition for an interaction $o, the usual Peierls argument 
can be repeated for those zero-temperature pure phases of $o for which the entropy 
factor can be shown to grow at most exponentially with the size of the boundary. 
Indeed, the Peierls condition ensures that the energy cost grows as least as fast but 
with an exponent including a factor /3, hence the energy cost beats the entropy gain 
for large enough /3, and only small boundaries are present. However, this energy-beats- 
entropy phenomenon is in general not true for all the pure phases, only for the stable 
ones. It turns out that to obtain a situation in which the entropy is beaten by the 
energy for all the rigid periodic ground-state configurations of $o — and hence all of 
them coexist — one must consider a perturbed interaction $ = $o + $ for a suitably 
adjusted $ (shift in the point of maximal coexistence). In general, not all the ground- 
state configurations for $o are ground-state configurations for $, hence this process 
of "tuning" $ requires us to consider a set /C not reduced just to configurations with 
minimal $-energy. 

Another reason to generalize the Peierls condition appears when studying whole 
regions of the phase diagram. In such a situation one is interested in estimates valid 
uniformly throughout the region; but a uniform Peierls condition, as stated in Defi- 
nition B.19, is not in general possible. For example, consider the Ising model in the 
presence of a strictly positive magnetic field. The only ground-state configuration is 
the all-"-|-" — to be denoted uj^~^> — and hence \d(jj\ is proportional to the number of 
"— " present. For instance, for the configurations uj equal to -|-1 everywhere except 
inside a cube W, the relative energy is H (uj^ \uj^~^>) = 2J\dW\ -\- 2hvol{W), while 
|9cu^| ~ vol(Vl/^). A simple calculation shows that for the Peierls condition to be valid 
for all these uj^ we need p < h. Thus the (original) Peierls condition is not satisfied 
uniformly in a neighborhood of the point /i = 0, which is precisely the most interesting 
region. 

A generalized Peierls condition must, therefore, allow configurations that are not 
necessarily ground states and also must satisfy some "uniformity" requirement. Such a 
condition is already contained in the work by Pirogov and Sinai, where the main results 
are shown to be consequence of a further generalized condition for the perturbed $ that 
follows from the the Peierls condition satisfied by $o- Zahradnik [366] was, however, 
the first to point out that this generalized condition is a more natural starting point 
from the conceptual point of view. We initially had a more concrete motivation: the 
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uniformity requirement is important for our example of the Kadanoff transformation 
(Section 4.3.3). In fact, this apphcation demands only a particular case of uniformity 
(Corollary B.25 below), and, moreover, the result we need is exactly given by a theorem 
due to Zahradnik (Theorem B.30 below). However, we shall take here the time to 
discuss the uniformity issue in some generality, because we feel that it has not been 
sufficiently emphasized in the literature. 

Let us first introduce the notion of contour. We fix a set K, of reference configura- 
tions and a choice of sampling cubes (value of a). The idea is to decompose the bound- 
ary in components: two sets A and B of sites are called connected if dist (A, B) < 1 
in lattice units. A contour of a configuration cu is a pair V = [M^lom) where Af is a 
maximally connected component of the boundary of io. The set M is often called the 
support of the contour F. At this point we start introducing constraints on the size of 
the sampling cubes. We require: 

(CI) The value 2a -\- 1 must be strictly larger than all the periods of the reference 
configurations cu G /C. 

Such a requirement implies the following extension property (nomenclature taken from 
[329]): if a configuration io coincides with a reference configuration cu G /C on the 
sampling cube Wa{x) and with cu' G /C on the cube Wa{y) with dist(a;,j/) < 1, then 
cu = cu'. This has the key consequence that we can reconstruct uniquely a configuration 
id starting from its family of contours. 

Each contour with a finite support divides ~L \ M into several disconnected com- 
ponents: One of them is unbounded, and is called the exterior of the contour; the 
others are bounded and are collectively called the interior of the contour. Each of 
these components has a reference configuration associated to it, namely that of the 
sampling cubes centered on sites adjacent to the support of the contour. The contour 
is a ui- contour if its exterior corresponds to the reference configuration cu. On the other 
hand, the cu^'^-interior — denoted Int^(,) (F) — is the union of the components of the 
interior of F associated to a reference configuration u}^'. In general, io_ will have other 
contours besides F, some of which may be in the interior of F. Hence cu may not coincide 
with io}^' on the whole Int^(,) The generalized Peierls condition is a requirement on the 
minimum energy cost of introducing a contour. This can be estimated by considering 
the configuration to^ that has F as its only contour. If F = (Af, cum) is a cu-contour, to^ 
coincides with io_ on the exterior of F, with u}^' on the whole Int^(,), and with lom on 
the support M 

Let us introduce now a periodic interaction $. The energy cost of the cu-contour F 
is given by the relative energy i7*(c<;'"|cu), which can be decomposed in the form: 

H^iuJ^U) = *(F) + J2[e<^{^^'^) - e$(cu)]|Int^(o| . (B.43) 

8 = 1 

The second term in the RHS is, up to terms proportional to |9Af|, the energy contri- 
bution due to the configurations in the interior of F. This term is absent if all the u}^' 
are ground-state configurations of $. The contour functional ^(F) is defined by the 
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identity (B.43); it is roughly equal to J2acm[^ Ai'^^) — $a(^)], but it also includes the 
just mentioned terms proportional to |9Af|. 

Definition B.22 An interaction $ satisfies the generalized Peierls condition — with 
respect to a set K, of reference configurations — if there exists a constant p > such 
that for each contour T = (Af, cum)- 

*(r) > p\M\ (B.44) 

The (original) Peierls condition (B.38) corresponds to the particular case in which 
JC = ^y=o(^)- W^ remark that this generalized condition is sometimes called just 
"Peierls condition", or "Gerzik-Pirogov-Sinai" condition. We shall also call a Peierls 
constant — for the interaction $ — a constant p satisfying (B.44). 

To understand why Definition B.22 has the desired uniformity, let us return to 
the example of the Ising model with magnetic field h > 0. We must now consider 
/C = {uj^~^\uj^~>}, where uj^~^> and uj^~> are the all-"+" and all-"—" configurations 
respectively. We notice, however, that uj^~> is not a ground state. With this choice of 
/C, the contours can be taken to be "thin" as in the zero-field case, and we have that 
for any cu ("'")- contour F 

H'^{uj^\J+y) = 2J|aW/| + 2/i|Int^(-)(r)| , (B.45) 

while for an cu^") -contour 

H'^{u^\J-^) = 2J|aW/|-2/i|Int^(+)(r)| . (B.46) 

So, comparing with (B.43) we see that the generalized Peierls condition is satisfied 
with p = 2J, uniformly in h. We see that this uniformity is gained by including the 
extra configuration uj^~' which is not a ground state, but rather could be interpreted 
as a "metastable state". 

In general, the uniformity property of the generalized Peierls condition is a conse- 
quence of an estimate valid for sampling cubes larger than the period and range of the 
interaction; that is, we impose the following extra condition on the sampling cubes: 

(C2) The value 2a -|- 1 must be strictly larger than the period and the range of the 
interaction $. 



We emphasize that due to requirements (CI) and (C2), the value chosen for a — that is, 
the definition of the contours — depends on the set /C and on the interaction(s) present. 
The reader should keep this in mind especially because, to keep formulas simple to 
read, the notation will not make this dependence explicit. In particular, a change in 
the interaction — for instance the addition of an arbitrarily small perturbation — will 
require the redefinition of the contours. 
Under condition (C2), 

J2 $^(cur)-|Int^(o|e$(cuW) 

A:An Int^(,) U9extlnt^(,) ^0 

< 2(2a + l)'^||$||Bo|aextInt^(o| (B.47) 
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which imphes the following key estimate. If F = (Af, cum) is a a cu-contour, then for 
any periodic interaction $ 

H^iio^U) - J2 |Int^(0|[e$(^^''^) - e$(^)]| < 2(2a + l)i$||,5o|M| . (B.48) 

8 = 1 

Therefore: 

Theorem B.23 (Uniformity property) // a periodic interaction $o satisfies the 
generalized Peierls condition with constant p, then for any interaction $ with \\^\\b° ^ 
cp/{2a + 1) , the sum $o + $ satisfies the generalized Peierls condition with constant 
p{l-2c). 

Another useful result, which basically follows from (B.48), is the following [328, Lemma 
2.2]: 

Proposition B.24 Consider a periodic interaction $o satisfying the original Peierls 
condition (B.38) with constant po. Then, for any other periodic interaction $ 

||$||go < p/{2a + ly =^ e^Pl^o($o + $) C e^rJo(^o) . (B.49) 

We present two corollaries of Theorem B.23. For our study of the Kadanoff transfor- 
mation we need the following trivial consequence: 

Corollary B.25 Consider a periodic interaction $o satisfying the original Peierls 
condition (B.38) with constant po, and another interaction $ such that ^x=o(^o) = 
^T=o(^o + $)• Then if \\^\\b° ^ cpo/{2a + 1)'', the sum $o + $ satisfies the original 
Peierls condition with constant po[l — 2c). 

However, the corollary more often used is: 

Corollary B.26 // $o satisfies the original Peierls condition with constant po [and 
JC = ^x=o(^o)7; then a "perturbed" interaction $ = $o + $ satisfies the generalized 
Peierls condition with constant po{l — 2c) [and the same ICJ if \\^\\b° ^ cpo/{2a + 1)''. 

This corollary generalizes what was observed regarding the Ising model in non-zero 
field. 

As the inclusion in (B.49) is in general strict, the last corollary implies that, from the 
point of view of$ = $o + $, the uniformity is gained at the cost of including some extra 
reference configurations that are not ground states (e.g. uj^~> in the above example). 
These extra configurations can be interpreted as "metastable states" or "local ground 
states" for $ [369]. On the other hand, any system with a finite number of periodic 
ground-state configurations ought to satisfy the generalized Peierls condition if one 
adds all the local ground states of the model [369] (or allow more complicated types of 
reference states). 

At the risk of being considered almost patronizing, we emphasize again that the 
size a in the previous results is chosen so as to satisfy (CI) and (C2) for the total 
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interaction $o + $• Often, $o is a simpler or more standard interaction that one 
studies independently or for which one can borrow results from the literature. The 
Peierls constant determined in this manner corresponds, hence, to a values of a chosen 
without reference to anything but $o- When considering in addition perturbations $, 
no matter how small, this size may need to be redefined to a new value a' suitable for 
the total interaction. If so, the Peierls constant po appearing in the previous results is 
smaller than the one initially determined. The simplest procedure at this point, if one 
does not want to completely redo the analysis with the new definition of contours, is to 
adopt for po the initial value divided by (2a' + 1)'' [leftmost inequality in (B.41)]. Note 
that the Peierls constant chosen in this way goes to zero with increasing range of the 
perturbations. In fact, in general one can not do much better than this. In particular, 
it is known (cf. Remark 4 in Section 2.6.7) that arbitrarily weak perturbations with 
long-range interactions can destroy the phase diagram. 

To conclude this section, we observe that the notion of contour can be presented in 
a slightly more general (and abstract) fashion. Indeed, the key properties supporting 
the rest of the theory are the unique reconstruction of a configuration from a set of 
contours [here a consequence of the extension property, requirement (CI)], estimates 
(B.44) and (B.48), and that the entropy gain be beaten by the energy cost at low tem- 
peratures. As long as these properties are satisfied, contours need not be defined via 
sampling cubes. An illustration of this observation is the use of "thin" contours in ferro- 
magnetic nearest-neighbor Ising models or, more generally, models whose ground-state 
configurations are constant. The boundary in such a model can be defined as a set of 
polyhedra, and the contours are non-self-intersecting closed (hyper) surfaces (uniquely 
defined via suitable fixed prescriptions to handle intersections), labelled by the config- 
urations of the adjacent spins. The labelling allows for a unique reconstruction of the 
configuration, and the thin contours satisfy estimate (B.44) with \M\ replaced by |r| = 
area of the polyhedra = number of plaquettes forming its faces, and estimate (B.48) 
with a determined on the basis of $. Moreover, they have smaller entropy than the 
"thick" contours. Note, however, that the remark discussed in the previous paragraph 
is especially relevant in connection with thin contours: in general, if the interaction 
is perturbed, one cannot use the value of po determined via thin contours; one must, 
for instance, divide it by a factor (2a -|- 1)'', where a depends on the perturbation $ 
considered. 

B.4.3 Results of the Theory 

We present here the main results of PS theory. We include some general comments 
on the underlying ideas, but we do not discuss the details of the proofs. These can 
be consulted in the bibliography. We mention that there are two approaches to PS 
theory: the original one, based on "contour models with parameters", and the more 
recent one, due to Zahradnik, based instead on a classification of contours into "stable" 
and "unstable" ones. References for the first approach are the seminal papers [296, 
297], Sinai's book [328], and Slawny's review article [329]. The second approach was 



227 



introduced in [366]; a concise presentation is given in [35] and a pedagogical one in [369]. 
This second approach is intuitively more appealing, provides some more information 
— as for instance the completeness [366] and analyticity [368] of the phase diagram — 
and has served as a basis for further extensions and applications of the theory [367, 287, 
288, 194, 35, 36]. In the comments below we mostly have in mind such an approach. 

The essence of Pirogov- Sinai theory — inherited from the Peierls argument — is 
the definition of maps from the original spin ensemble into ensembles of contours that 
interact only by volume-exclusion^ that is, into gases of contours. The families of con- 
tours in the latter do not necessarily correspond to an actual collection of contours of 
a spin configuration, because they are not required to "match" exteriors with interiors. 
For instance, a set of two "—"-contours, one inside the other, is an allowed element 
of one of the contour ensembles, even when there is no spin configuration having it 
as its family of contours (in a spin configuration there would be an intermediate "-|-"- 
contour). This lack of "matching" requirement makes the contour ensembles much 
simpler systems to work with. The maps are defined so that each stable pure phase 
is equivalent to a contour ensemble in the sense that both have the same distribution 
of external contours. The low-temperature picture of only small "islands" of over- 
turned spins can then be precisely proven by estimating the probabilities of (external) 
boundaries using the contour ensembles. 

One considers r different contour ensembles, one for each reference configuration 
ij}^' G /C. The z-th ensemble is formed by all the cu^'^-contours interacting only via the 
restriction of being separated by 2 or more lattice units. The statistical weight of each 
contour is given by an activity exp[— Fi (F)] with a functional Fp (T) determined via a 
relation (formulas (1.14) or (1.19) in [366]) that roughly compares the "work needed to 
install a contour" [369] in the spin and contour ensembles. [In the original PS theory, 
some extra weights exp[M')|Int (F)|] are assigned to the external contours [328], and 
both the "parameters" b^^> and the functional F^^> are also determined by comparing 
"works" (formula (2.43) in [328]). We prefer to follow here Zahradnik's approach in 
which the "parameter degree of freedom" is absorbed into the functional Fp .] 

Fach contour ensemble is a statistical-mechanical system of its own, which can be 
studied without any reference to the original spin system. Properties of these contour 
models can then be transcribed into results for the spin system via the identification 
between the ensembles. This is the usual policy in the standard expositions of the 
theory, all of which include an "interlude" in which abstract contour ensembles are 
analyzed per se (Sections 7 to 9 in Chapter 2 of [328], Section 2 in [366], etc). Basically, 
contour models are studied via cluster-expansion techniques: this is the method of 
choice for systems at "high temperature" or "low density" . All the contour ensembles 
satisfy one of the key ingredients of the Peierls argument: the entropy factor grows at 
most exponentially with the size of the contours [328, Femma 2.7] (this fact is false for 
d = l!). Therefore, there is a marked difference according to whether the functional 
Fp defining the contour activity satisfies a bound of the form 



Ff(V) > T«|M| (B.50) 
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ith Tp > 0. If this is the case, it is customary to say that Fp is a t^ -functional. [For 
the original PS approach, the big difference is whether the corresponding parameter 
b^^' is zero; all the functionals F^^' in the PS approach are r-functionals.] 

The contour models defined by r-functionals enjoy several remarkable properties if 
T is large enough to overcome the entropy growth. This growth is characterized by an 
exponential factor bounded by [328, Lemma 2.7] 



a 



max{jlog(2a + I) , log |0o| + 3^^} . (B.5i; 



If the contour model has a convergent cluster expansion, which occurs at least if [328, 
Lemma 2.8 and Propositions 2.1 and 2.2] 

4'^ > 4a , (B.52) 

then it has a well-defined thermodynamic limit, with a well-defined pressure and 
infinite-volume probability measure. For this measure, infinite contours have zero 
probability of occurrence, more generally, the probability for a given contour to be 
present decreases exponentially with the size of its support. Moreover, the measure 
satisfies exponential mixing conditions for disjoint families of external contours. (See, 
for instance. Sections 7-9 of [328].) Furthermore, each of such contour measures is 
equivalent to a Gibbs measure in the spin system: if Fp is a tI -functional, with 

Tfj ^ 4a, then the (infinite-volume) probability density of external contours of the 
contour ensemble is equal to that of the Gibbs measure — at inverse temperature fi — 
of the spin model defined by the u}^' boundary condition. Thus, this Gibbs measure 
inherits the sparsity of (external) contours characterizing the contour ensemble and its 
mixing properties. It is, therefore, an extremal periodic Gibbs measure (pure phase) 
which is only a small perturbation of the reference (in fact ground-state) configuration 
io}^'. The precise result of this argument is: 

Theorem B.27 (Pirogov-Sinai-Zahradnik) Assume d > 2. If a finite-range pe- 
riodic interaction $ satisfies the generalized Peierls condition (B.44) with respect ot 
a finite set of periodic reference configurations K, = {cu^^V--?^ }; then there exist 
/3o < oo such that for each /3 > /3o 

(a) All the pure phases are Gibbs measures jil defined by the boundary conditions 

io}^' with Fp being a tI -functional. In this case, ri — )• oo as [3 —■ oo. 

(b) Fach pure phase jil is concentrated on configurations with finite boundaries and 
moreover, the probability that a given boundary be present tends to zero as fi ^ oo. 
More precisely, ifV = [M^lom) 

jip {V external contour-} < e~'^^ ' ' . (B.53) 

This theorem was proved by Pirogov and Sinai (see for instance [328, Proposi- 
tions 2.6 and 2.2]), except for the word "all" in Part (a), which was incorporated by 
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Zahradnik [366] ("completeness"). One of the consequences of this completeness is that 
if the interaction $ has a unique periodic ground-state configuration, and it satisfies 
the Peierls condition, then there is also a unique pure phase at low temperature. In 
fact, Martirosyan [260] has proven that, in this situation, in d > 2 there are no other 
extremal Gibbs measures, periodic or not. 

The parameters ri characterizing the pure phases are of the form (see the proof 
of Proposition 2.3 in [328]) 



4'^ > /3p- e^'^ with e('') < 2e-V 3^^ , (B.54) 



where p is the PS-constant of the interaction $. From this expression one can obtain 
some (far from optimal) estimates of the parameters involved. Indeed, for example we 
can choose 

T« = T^ (B.55) 

with Tp satisfying 

Tp = ^p- le-^i^t . (B.56) 

Then, by the requirement (B.52) we obtain the bound 

Tfi> ^p-^ > ^a (B.57) 

2e 



and hence 



4a + l/(2e) 

/3o = '^ ' . B.58 

P 

Note that as /3 — )• oo one also obtains, from (B.56) 

Tp = fjp + Oie-^"). (B.59) 

In principle. Theorem B. 27(a) provides a criterion for the stability of a ground-state 
configuration, but it is quite useless for practical applications. The work of Zahradnik 
[366, 369] provides a different criterion which could be employed for a computer-based 
procedure. It is based on the computation of the pressure p{Fp ) for the cu^'^-contour 
ensemble but including only small (or stable) contours. These are contours whose 
interior volume is at most proportional to the size of the support. At low temperature, 
the coexisting pure phases are those minimizing 

hf ^ ec(c.W)-p(i^«). (B.60) 

It can be proven [366] that p{Fp ) — )• as ri — )• oo (i.e. [3 --■ oo), thus the minimizing 
configurations are ground states. Hence, (B.60) means that the stable ground-state 
configurations are those maximizing the contour-ensemble pressure; that is, those ad- 
mitting the larger number of low-energy small contours. A related stability criterion 
was developed by Slawny [329], employing the pressure of a gas of elementary excita- 
tions instead of the contour-ensemble pressure. The stable phases are therefore deter- 
mined as those with the larger number of low-energy excited configurations [dominant 
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ground-state configurations). This criterion is simpler to apply for paper-and-pencil 
calculations. 

The preceding theorem is the main tool used to prove the stability of the whole 
phase diagram. Let us denote Us{\ ) = {A G R ~ : J2lZi \^i ~ K\ < ^}- 

Theorem B.28 (Pirogov-Sinai-Zahradnik) Consider a finite-range periodic inter- 
action $0 in dimension d > 2 such that: (i) it has r < oo periodic ground-state 
configurations and (ii) it satisfies the original Peierls condition (B.38). Consider a 
perturbation $- = $o + YliZi^i^i) with each $i periodic and of finite range, that 
completely breaks the degeneracy of^Q. Then there exist positive constants /3o, eo such 
that 

(a) For each /3 > /3o there exists a nonempty open set V^ C R ~ such that for 
parameters A G V/j the phase diagram at inverse temperature fi is r-regular. For each 
A G V/j results (a) and (b) of Theorem B.27 hold for the interaction $- taking as 
reference configurations the ground-state configurations o/$o- 

(b) Moreover, there exists an invertible map 

Ip:U._M^Vp (B.61) 

(the "underlying deformation of the parameter space" [369]), which maps each zero- 
temperature coexistence manifold onto the corresponding coexistence manifold at inverse 
temperature fi (more generally, stratum onto stratum). The map Ip converges to the 
identity as /3 — )• oo. In fact, it is an homeomorphism, and even C°°. 

(c) The phase diagram deforms analytically with temperature in the following local 
sense: Consider a point (A , /3') of the phase diagram., with |A | < eo and fi' large enough 
[its minimum value depends on the distance from A to the complement of Ueg (0)7. Let 
K be the set of reference configurations giving rise to the pure phases for the interaction 
$v-' at inverse temperature fi' . Then there exists an analytic function 

13 ^ A(/3) (B.62) 

such that for fi close to fi' and A(/3) close to A , the pure phases for the interactions 
^A(«) ^^ inverse temperature fi correspond to the same set K of reference configurations. 

Pirogov and Sinai proved Parts (a) and (b) of the theorem except for the com- 
pleteness of the phase diagram and the homeomorphic and C°° character of I p. These 
additional results are due to Zahradnik [366, 369], who also proved Part (c) [368]. 

As remarked in [369], maps other than (B.62) need not be analytic. In particular, 
the (/3-dependent) maps Vp — > dQr establishing the regularity of the phase diagram 
at inverse temperature fi are, in general, not analytic. Such a map can be defined, 
for example [366, 369], analogously to the zero-temperature map (B.32)-(B.33) but 
replacing e$_(/ij) by hn [see (B.60)]: 

A ^ {h^\\) - mm /.«(!), . . . , 4'-)(A) - mm hf(\)) . (B.63) 
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[In the original PS formulation, the parameters b^^> played the role of the h^^> here.] 
Such a map is in general not analytic because if it were it would imply that the free 
energy of a pure phase could be analytically continued in A into the metastability 
region; and already for the Ising model it has been shown [204] that such an analytic 
metastable extension does not exist. On the other hand, the map (B.63) is of limited 
physical significance, because for lj}^> not defining a pure phase, the corresponding 
quantity h^^>{\) is only an auxiliary concept, not even uniquely defined [369]. The 
physically interesting objects are the strata 

Sxif) = {J:Qfj{J) = K} (B.64) 

for each A^ C /C, where 

g,(A) = {z : hf{J) = mjn hf{J)} . (B.65) 

These strata deform (locally) analytically with the temperature, by Part (c) of Theorem 
B.28. 

Non-optimal estimations of the limit values /3o and Cq of Theorem B.28 can be 
obtained combining Corollary B.26 with (B.58): If $o satisfies the Peierls condition 
with constant /Jq, then at most 

A ^ '-^i^ (B.66, 

po{l - 2c) 



^0 = 777-^^^ • (B.67) 



if at least 

cpo 

(2a + 1) 

These bounds are an explicit example of a general fact about the proof of Theorem 
B.28. As all the results follow from studying the equivalent contour ensembles, the 
relevant magnitudes are those actually used to construct these ensembles: the Peierls 
constant po and the exponential entropy factor a. In addition, the dimension d and 
the size a of the sampling cubes appear via the uniformity property. As a consequence, 
we have: 

Corollary B.29 Consider a family of original interactions {^o{p)}-pp^^m satisfying 
the Peierls condition uniformly that is, with the same constant po and the same family of 
periodic ground-state configurations K, for allp^P (e.g. in the conditions of Corollary 
B.25). Then Theorem B.28 holds also uniformly for all the interactions $o(p) [i-e-, one 
can chose the same jSq and Cq in Parts (a) and (h), and the same fi- and A- intervals 
in Part (c)]. 

More generally. Theorem B.28 can be extended to situations in which there is a 
further smooth dependence of the interactions on the extra parameters p. 

Theorem B. 30 As.sume d > 2 and consider interactions $o(p), $i(p), • • • , $r-i(p) 
depending analytically on parameters p taking values on an open set P C R™, and with 
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bounded period and range. Assume that there exists apg^P such that (i) $0(^0) ^^•^ 
r < 00 periodic ground-state configurations and (ii) $o(Po) satisfies the original Peierls 
condition (B.38). Assume also that for each p ^ P the perturbation ^-{p) = $o(p) + 
J^lZl \i^i{p) completely breaks the degeneracy o/$o(p)- Denote [3 = {jS,p). Then there 
exist positive constants /3o, £0 o-nd ei such that the results (a), (b) and (c) of Theorem 
B.28 hold replacing fi by fi and the condition "fi > /3o" by "fi > fio,\p — Pq\ < e\ " . 

There are no simple explicit formulas for the values /3o, £0 and ei. 

As mentioned at the end of the preceding Section B.4.2, the theory can be adapted 
to slightly more general notions of contours. In particular it applies for the "thin" 
(polyhedral-like) contours of models with constant ground-state configurations (e.g. 
the ferromagnetic Ising model, Blume-Capel models, etc). Such contours provide the 
best (largest) estimate of po (e.g. for Ising models po = 2J), and have and entropy 
growth with an exponential factor bounded by (see, for instance, [169]) 



a, 



\og{2d-l). (B.68) 



This bound is smaller than (B.51), and therefore yields another source of improvement 
on the estimates of /3o in (B.58) or (B.66). Moreover, in the bound (B.67) for Eq, we 
can use 2a = range of the perturbation X] \^i] which is the minimal possible choice. 

B.4.4 Extensions of the Theory. The Random Case 

Pirogov-Sinai theory has been extended in several directions. For example, we mention 
the extensions to systems with long-range [287, 288], quasiperiodic [221] and complex 
[21, 295, 368, 35] interactions; systems with continuous spins [97, 367]; systems on a 
continuous space [45, 46]; field-theoretical systems [202, 203, 35]; and systems with 
infinitely many periodic ground-state configurations [46, 47, 80, 79, 77, 179, 78, 48]. 
Among these are models with energy cost growing slower than the area of the boundary, 
such as the balanced model [132, 48] and the ANNNI model [48, and references therein]. 
Also worth mentioning are the applications to the study of interfaces [367, 369, 194], 
random surfaces [261] and finite-size scaling [36]. 

These extensions generalize the theory chiefiy in two directions. First, ensembles 
of interacting contours are introduced [202, 203, 21, 46, 79, 77, 287, 288, 48]. The 
interaction among contours (on top of volume exclusion) must be weak, to allow the 
convergence of the cluster expansions. Second, the set of reference configurations is 
replaced by a set of reference measures. These can be supported on whole classes 
of ground-state configurations [179] or, more generally, on families of configurations 
— restricted ensembles — suitably chosen so as to include entropy contributions. In 
many cases, these restricted ensembles are formed by low-energy excitations of ground 
states [80, 79, 77, 78, 48, 261], but other definitions are in principle possible. For 
instance, the ensemble for the disordered phase of the Xaxge-q Potts model — and 
for other examples pertaining to the study of liquid-gas phase transitions [46] — is 
supported on "maximally disordered" configurations [46]. The latter corresponds to 
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a "pure-entropy" restricted ensemble [46], as opposed to the "pure-energy" (just one 
ground state configuration) or "almost-pure-energy" (ground state plus excitations) 
used in most of the applications. In general, the restricted ensembles are chosen so to 
have minimal (restricted) free energy at the temperature of interest. Often, interacting 
contours and reference measures are alternative procedures; it is a matter of taste to 
choose one or the other. 

A third direction in which PS theory has been extended — and one which is crucial 
for our application to the proof of RG pathologies in nonzero magnetic field (Section 
4.3.6) — is towards the incorporation of random interactions. In work unpublished so 
far, Zahradnik [370, 371] — generalizing the work of Bricmont and Kupiainen [43, 44] — 
has proven that for d > 3 the addition of a small enough random interaction only 
produces small deformations in the phase diagram. An important issue is the meaning 
of "small enough". In the original work [370], the random interaction was required to 
be uniformly small respect to the nonrandom part. Later we were informed [371] that 
the proof also applies to random contributions that are small in probability. We state 
the later version, which is the one suited to our applications. 

Theorem B.31 Consider the lattice Z , d > 3, and an interaction $- = $o + 
J^lZl Aj$i satisfying the hypothesis of Theorem B.28. Add a finite-range random in- 
teraction (I)'<*°™ = \ (i)^°™( • , k) ^ , where k is a random variable with probability dis- 
tribution P , such that the random variables $^°™( • , n) and $^°™( • , n) have the same 
distribution if A' is a translate of A and are independent if A Cl A' = 0. Assum.e, 
in addition, the following smallness condition: For each (5 > there exists e{8) small 
enough such that 

P(|$-°-(cu,«)|>^) < e (B.69) 

for all A ^ S, id ^ ft. Then, for fi large enough the phase diagrams for $- and 
$--|- (I)'<*°™ are homeomorphic. More precisely, for fi large and e uniformly .small, there 
exists an hom.eom.orphi.sm. 

between two open sets Vp^,Vp^ C R*^" mapping an r-regular $- phase diagram onto 
an r-regular $"+ $"*°™ phase diagram.. The hom.eom.orphi.sm. Lp^^ tends to the identity 
as e — )• uniformly. 

B.5 Application to the Examples of Section 4 

B.5.1 General Strategies 

In principle, the verification of the Peierls condition for a certain interaction $o is a 
two-stage process: 

(A) Find all the periodic ground-state configurations of $o- This stage usually 
involves counting how many "frustrated bonds" each candidate configuration has. This 
should be followed by a proof showing that indeed no other periodic configuration has 
the same or less energy density, but this proof is usually omitted because it is either 
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considered to be obvious or too messy to write down. Furthermore, as remarked before 
Definition B.20, this proof is not really necessary if the Peierls condition (next stage) 
can be successfully verified. In this regard, the tedious process of finding these reference 
configurations is a natural candidate for a computer-assisted procedure. However, this 
may not be possible, in general: the problem of checking whether a given periodic 
configuration is a ground state may not be algorithmically decidable (see [313, Section 
4.15] and [316] and references therein for some related undecidability results, and see 
also [263]). In any case, this problem may be alleviated in practice if one works in the 
framework of PS theory. Indeed, the further steps of the theory work as a correcting 
mechanism: if too few configurations have been found, the Peierls condition will fail 
and the the configurations of large contours with very low energy density will give a 
hint of how additional ground-state configurations look like. On the other hand, if too 
many configurations have been selected, the spurious ones will be eventually ruled out 
in the sense that they will not give rise to pure phases; they lead to contour ensembles 
with high free-energy cost, which are not associated to r-functionals. [We owe this 
insight to conversations with Milos Zahradnik.] 

(B) Devise a suitable notion of contour and show that the energy grows propor- 
tionally to its support. This is an extremely model-dependent process. 

Often, the determination of ground-state configurations and of contour energies 
are done simultaneously: The comparison between the energies of different candidate 
configurations is done already with the help of contours. This is a manifestation of what 
we have repeatedly commented upon: the contour energy is the essential quantity; the 
crucial stage is (B). If we manage to show that contour energies are large enough then 
we do not need to care about the nature of the reference configurations, they are ground 
states by force. Yet again, the same argument used to prove the appropriate growth 
of contour energies, usually shows the ground-state character of the configurations. 
Proving (B) directly is not a substantial saving of misery. 

Below, we shall use this canonical approach for decimation and Kadanoff-p pre- 
scriptions, where we can resort to the Ising type of contours: polyhedra drawn on the 
dual lattice with plaquettes perpendicular to each frustrated bond. For the studies on 
other RG transformations we shall use instead the Holsztynski-Slawny criterion (The- 
orem B.21) based on the notion of ?7i-potentials (Definition B.20). The corresponding 
procedure usually starts by rewriting the interaction into an equivalent form which is 
indeed an ?7i-potential. This is the hard part of the game; it involves some knowledge 
of what the ground-state configurations look like. Once the rewriting is done, the ac- 
tual verification of which configurations have minimal energy is in general simple; one 
studies one bond at a time. As already remarked, the disadvantage of this approach 
is that it does not supply a value for the Peierls constant, a fact that, in our case, 
prevents us from producing any estimate of the range of temperatures for which the 
pathologies of the RG transformations occur. 

We observe that for the Steps 1-3 in the proofs of existence of pathologies, we do not 
need the full information on phase diagrams provided by PS theory. Rather, we are only 
interested in the point of maximum coexistence (for Step 1), and regions of uniqueness 
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of Gibbs measure (for Step 2). Moreover, both types of questions refer to different 
systems: The maximum coexistence point must be determined for internal-spin systems 
obtained when the image spins do not favor any phase of the original system (this 
amounts, in general, to block spins chosen in an alternating or random way). In these 
cases, the symmetry-breaking perturbation is chosen simply as a uniform magnetic 
field and symmetry considerations imply that maximum coexistence is achieved only 
when this field is zero. Therefore, this symmetry-breaking interaction plays an almost 
invisible role, and it is only briefiy mentioned. The only case in which the symmetry- 
breaking deserves careful consideration is that of systems which already initially have 
a magnetic field (Sections 4.3.6 and B.5.7). On the other hand, the uniqueness of 
Gibbs measures is of interest for internal-spin systems corresponding to block spins 
chosen so as to definitely favor one of the phases. It is not wise to think of these two 
internal-spin systems (determined by block spins either favoring or not favoring one 
pure phase) as living in the same phase diagram with block-spin hipping as symmetry- 
breaking interaction. The problem is that such an interaction can not be considered a 
perturbation: it goes by finite steps and hence may throw us out of the small-parameter 
region Vp (see Theorem B.28) of the phase diagram where PS theory holds. 

Let us now discuss the different applications starting from the simplest ones. 

B.5.2 Internal-Spin Systems with Unique Ground-State Configurations 

In all the applications of Section 4 there is a step (Step 2.2) which involves showing 
that at low temperature the ensemble of internal (or original) spins has a unique Gibbs 
measure for some particular choice of image spins. These are all cases in which there is 
only one (periodic) ground-state configuration, due to the presence of a periodic single- 
sign magnetic field. The uniqueness of the Gibbs state is, basically, a consequence of PS 
theory plus Zahradnik's completeness result (see the comment immediately following 
Theorem B.27). However, this would only prove uniqueness among the periodic Gibbs 
measures; we need Martirosyan's extension of this result [260] to prove uniqueness 
among the set of all Gibbs measures. 

B.5.3 Internal Spins under Decimation 

In this case we can apply the canonical two-stage process described above to verify 
the Peierls condition. As discussed in detail in Sections 4.1.2 and 4.2 (Step 0), the 
ensemble of internal spins for a given image-spin configuration is just the ensemble of 
configurations of original spins with some of the spins constrained to be fixed. To the 
latter we can apply the usual Peierls construction of polyhedral contours. We shall 
therefore use the "thin" notion of contours discussed at the end of the two previous 
Sections B.4.2 and B.4.3. The internal-spin contours are obtained from the original- 
spin contours by removing the plaquettes adjacent to an image spin [thick lines in 
Figure 14(a)]. The argument that follows is thus based on comparing internal-spin 
with original-spin (= internal -|- image) quantities. 
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We fix the image spin in tlie fully alternating "+/— " configuration, so the two obvi- 
ous candidates to ground-state configurations in the resulting system of internal spins 
are the same as for the original system: lj}~^> equal to -|-i everywhere, and lj}~> equal 
to — i everywhere. That is, in terms of original spins, lj}~^> (resp. id}~') corresponds to 
all spins "-|-" ("— ") except for a sublattice of period 26 — with b being the decimation 
spacing — where the spins are fiipped. The symmetry-breaking perturbation Ai$i can 
be taken to be a magnetic field at each (internal) site. However, the symmetry of the 
problem (i.e. of the choice of block spins), implies that the coexistence of the "-|-" and 
"— " measures occurs at zero values of this field. We therefore forget about this extra 
field, and concentrate on proving that the zero-temperature phase diagram deforms 
little for low temperatures. 

If we wish only to prove that lj}~^> and lj}~> are all the internal-spin ground-state 
configurations, we can for instance consider the corresponding set of original-spin con- 
tours, which is just an array of 26-spaced unit cubes surrounding each fiipped spin, 
and show that all other original-spin configurations lead to a system of (original-spin) 
contours with a larger area. This is not hard to do, for instance we can argue as follows: 
Observe that every interval parallel to the axis between two nearest-neighbor image 
spins necessarily contains at least one broken bond, as two nearest-neighbor image 
spins always have opposite signs. The choice of all internal spins either all -|- or all — 
has precisely this minimal choice of exactly one broken bond in each interval, and no 
other broken bonds. 

However, such an argument is not really needed. As commented above, a more con- 
venient approach for our purposes is to show directly that the insertion of a contour 
in Lj}~^> has an energy cost proportional to its area. Consider then the (internal-spin) 
configuration uj^ obtained by inserting a "-|-" (internal-spin) contour F inside the con- 
figuration id}~^\ that is, a region of "— " bounded by F [Figure 14(a)]. Its relative energy 
can be written in the form: 

H{uj^\J+^) = 2J\T\-AE_ + AE+, (B.71) 

where |F| is the area of the contour [number of plaquettes, or length of the thick lines 
for the two-dimensional example of Figure 14(a)], AE^ is the energy gain due to the 
fact that the "— " image spins inside of, or visited by, F acquire "— " neighbors [e.g. the 
sites ai in Figure 14(a)], and AE^ is the additional energy due to "-|-" image spins in 
F. 

To prove the Peierls condition we have to check that the extra contribution AE^ — 
AEj^ is proportional to the area of the contour with a not too large proportionality 
constant. The intuition is clear: The contributions corresponding to "-|-" and "— " 
image spins inside the volume cancel each other, except possibly for a layer of image 
spins placed close to the contour. This correction is hence of the order of the area of the 
contour, with proportionality constant given roughly by the inverse of the separation 
between these image spins. However, this last bound is not applicable if the contour 
involves few (e.g. one) image spins. In this sense, it is natural to distinguish between 
contours surrounding and contours avoiding the image spins. While the former "feel" 
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(b) 



Figure 14: System of internal spins (small circles) under decimation, when the image 
spins (squares) are fixed in the fully alternating "+/— " configuration, (a) A "— " 
contour F (thick lines), (b) The fundamental bonds (bounded by thick lines) of an 
equivalent ?7i-potential. 
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the sublattice of image spins, the energy contribution of the latter is almost the same 
as for the usual Ising model. This produces an estimation of the Peierls constant (and 
hence of the critical temperature), with two competing terms: one depending on the 
decimation period b (and tending to as 6 tends to infinity), and another independent 
of b and close to the Peierls constant for the Ising model (the closer the higher the 
dimension). 

To formalize these ideas we choose one coordinate axis, say the one labelled 1, and 
perform the cancellations by sweeping in order along it. To abbreviate, let us call "left" 
the direction in Z towards smaller 1-components, and "right" the opposite direction. 
Also, we shall say that an image spin is "in" the contour if it is visited by it or it is 
contained in its volume. More specifically, an "internal spin with / plaquettes in the 
contour" is an internal spin such that the contour surrounds / of the plaquettes of the 
unit cube centered on it (e.g. in Fig. 14(a), the internal spin at ai has 1 plaquette in the 
contour, the one at 04 has 3, etc.). Note that, in such a case, the energy contribution 
of the image spin is 

AE_ = 2\J\l (B.72a) 

AE+ = (B.72b) 

if it is a "— " spin, and the converse for a "+" image spin. The cancellation can be 
done, for instance, as follows: For each line in the left-right direction intersecting the 
contour, we choose the image spin in the contour further to the left and look for the 
next image spin in the contour, of opposite sign and located to the right and along the 
same line. If the "+" image spin has the same number of plaquettes or more in the 
contour than the "— ", we cancel both contributions (obtaining a lower bound for the 
energy if the number of plaquettes is not the same). Otherwise we do nothing. We 
then proceed to the next uncancelled image spin along the same line, always travelling 
towards the right. Once all the left-right lines have been scanned, we obtain a lower 
bound for the energy of the form (B.71) but where AE^ and AE^ refer only to a layer 
of image spins in the contour at a distance not exceeding b -\- 1 from it (remaining 
spins). The energy gain due to these remaining spins can not exceed that of the case in 
which there are no "-|-" image spins left and all the "— " spins have their 2d plaquettes 
in the contour. We therefore bound 

AE_ - AE+ < 2dJN_ , (B.73) 

where N^ is the number of "— " image spins inside the above-mentioned layer. 

To complete the Peierls bound, we have to relate N^ to the contour area |r|. At 
this point we must distinguish between contours with N^ > 2 ("wide contours") and 
contours with N^ < 1 ("narrow contours"). For the wide contours, the key observation 
is that each two "— " image spins must be at least a distance 26 apart in each coordinate 
direction and the contour must pass at a distance b-\-l or less of each of them. Therefore, 
the number of remaining "— " image spins for a given value of F can not exceed that 
of the case in which all of the spins are located so as to form a "tube", separated 26 
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units from each other, and F being the wall of such a "tube": 

-''- 2 2(cl -!){/■- 1) ■ <^-'^> 

(Note that the bound contains 6—1, rather than 6, because the plaquettes corresponding 
to "+" image spins are not part of the contour.) From (B.71)-(B.74), we conclude that 



H{uj^\J+^) > 2J 



d 
1 



2{d-l){b-l) 



|F| ; iV_ > 2 . (B.75) 



This bound is not useful for the limit case J = 2, 6 = 2; but for it we have already 
good bounds for the critical temperature (Section 4.1.2). 

Fet us now consider the narrow contours (A^_ < 1). It is not hard to convince oneself 
that the worst case is when the contour visits only one "— " image spin, which has / 
plaquettes in the contour. The contour must then include the opposite / plaquettes 
plus the plaquettes needed to join these among themselves or/and to the cube so to 
form a closed surface. One can check that the worst situation (smallest ratio //|F|) 
is when the / plaquettes of the cube are not consecutive, in which case the contour 
includes the / opposite plaquettes and the 2[d — 1)1 plaquettes needed to join them to 
the cube. Hence, 

|F| > l[2{d-l) + l] . 

Therefore, using (B.72), 

2J_ 

21^1 

and 



AE_ -AE+ < — riFl 



H(Lo^\J+y) > 2J 



1 
1 



|F| ; iV_ < 1 . (B.76) 



2d- 1. 

(Another way to interpret this Peierls bound is by noting that the narrow contour with 
the least energy cost is the one produced by a single flipped internal spin adjacent to 
a "— " image spin.) 

Formulas (B.75) and (B.76) show both that the configurations lj}~^> and lj}~> are 
indeed the only periodic ground-state configurations and that the Peierls condition is 
satisfied for the internal-spin system with Peierls constant 

po > 2J(1 - Md,b) , (B.77) 

where 

( 1 d ] , , 

Mdb = max<^— , — — -} . (B.78) 

\2J-1 ' 2{d - l){b - 1) j ^ ' 

This value of /Jq, together with the estimates (B.58) for /3o and (B.68) for the entropy 
factor a of thin contours, implies there is a phase transition at least for 
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This estimate is probably very weak, but at least it increases with d and with b as it 
should. In fact, if b is large, the alternating fields are very far apart, and the system 
becomes almost indistinguishable from a zero-field Ising model. Therefore we expect, 
but can not prove, that this limit temperature approaches the Ising-model critical 
temperature when b tends to infinity. 

Inequality (B.79) determines the range of temperatures for which we can prove that 
the decimation transformation has pathologies (Sections 4.2 and 4.3.2). 

As a warm-up for the following sections, let us sketch how the Peierls condition can 
be verified in the present example using the Holsztynski-Slawny criterion (Theorem 
B.21). The argument depends slightly on the decimation spacing b. For 6 = 2 the 
internal-spin interaction is already an ?7i-potential because it is just a ferromagnetic 
two-body interaction in a "diluted" lattice (Sections 4.2 and 4.3.1). For 6 > 3 a 
periodic ?7i-potential is obtained by grouping all the bonds inside cubes containing at 
least one period of the image-spin configuration [Figure 14(b)]. Fxplicitly, if (I)'"'""^'' 
is the interaction for the internal-spin system, the equivalent 26-periodic ?7i-potential 
(j)m-pot j-^g^g (jjm-pot _ Q ^j-^iggg ^ is a periodic translate, with period 26, of the cube 



V 



— 


6-1 

2 


1 


36-1 " 

2 



;b.8o) 



(here [J denotes integer part), or of the inter-cube bonds formed by nearest-neighbor 
pairs (x^y) with, say, x in the (internal) boundary of V. For these pairs ^T~Y* — 
^{xy} = ~J ^ and for the cube V 



^TTl — pot 



E 

ACV 



^internal 

^A 



[BM] 



It is not hard to verify that the configurations io}~^' and to}' are the only minimizers 
of the $y~''°', and they obviously also minimize the energy of the inter-cube bonds. 
Therefore, (j)™-?"' is an ?7i-potential with a finite number of ground-state configurations. 
By Theorem B.21, the Peierls condition is satisfied. No estimation of /3o follows from 
this approach. 



B.5.4 Internal Spins under the KadanofF Transformation. Uniformity 

For this case we have to apply the uniformity results that we so carefully stated above. 
The Hamiltonian (4.33) can be decomposed in the form 



HeS = Ho + Hp 



:b.821 



where Hq is the usual Ising Hamiltonian and Hp corresponds to the interaction $(p, /3) 
defined by 



:^AiP,m<^) 



' -ip/l^)K ^« if A = {i} and i G B^ 

(l//3)log2cosh(^pE.eB.^.) iiA = B, (B.83) 

otherwise . 
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(We recall that in this appendix we are "unabsorbing" /3 which in (4.33) is absorbed 
only in J.) We choose cu^ j^j as some alternating configuration — with as many pluses 
as minuses — so that, by symmetry, the coexistence of the "+" and "— " internal-spin 
Gibbs measures does not require any additional field. That is, as before we forget 
about symmetry-breaking interactions. 

The interaction $o satisfies the original Peierls condition with ^q = 2J (thin con- 
tours, /C = {uj^~^\uj^~>}). We can then resort to Corollary B.25 to conclude that the 
whole interaction $o + ^(p,/?) satisfies the original Peierls condition. However, to es- 
timate the Peierls constant we must correct ^q so as to satisfy (CI) and (C2) for the 
total interaction. For instance (see remarks at the end of Section B.4.2), we can replace 

it by 

2/3J 

^° " (2a + 1)^^ ' 

with a equal to half the length of the largest side of the block. We then conclude that 
for each p there exists a value /3i(p) defined by 



f + -log(2coshp|i?|) = 777——^, (B.84) 



such that for /3 > /3i(p) the effective internal-spin interaction for the p-Kadanoff trans- 
formation satisfies the Peierls condition with constant 

In the last two formulas, the constant c is arbitrary as long as < c < 1/2. We shall 
find an optimal choice below. 

At this point we can apply, for instance. Corollary B.29 to obtain that for each 
finite p there exists a value /3o(p) such that for fi > fio{p) the system of internal spins 
corresponding to a p-Kadanoff transformation has a nontrivial phase diagram with a 
first-order phase transition between a "-\-" and a "—" Gibbs measure. The formulas 
(B.66) and (B.68) imply the estimate 

/3o > max|A(p) , [41og(2J- I) + {2e)-']-^^±^^ . (B.86) 

From the point of view of this bound, the optimal choice of c is when fi\{p) equals the 
competing term in the RHS of (B.86). This produces the rather ugly-looking bound 

2J Md[iL,^\B\ + Md- ^MdiMd + 8Z,,|B|) ' 

with 

Md = 41og(2J-I) + (2e)-i 
Lp,\B\ = p + log2coshp|i?| . 
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Formula (B.87) gives a lower bound for the temperature up to which a KadanofF 
^-transformation exhibits pathologies (Section 4.3.3). This bound goes to zero if p 
tends to infinity, hence it is useless for majority-rule transformations. 

B.5.5 Internal Spins under Majority Rule 

For this case, we use the Holsztynski-Slawny criterion (Theorem B.21). The system 
of internal spins subjected to the constraint of a doubly-alternating 7x7 block-spin 
configuration can be written as a periodic ?7i-potential with period 28. The fundamental 
bonds of this potential are the squares of size 28 X 28 depicted in Figure 8(a), and all 
the bonds connecting neighboring squares. It is straightforward to check that the 
configurations of Figure 8(b) are the only ones satisfying all the bonds of this m- 
potential. Hence, the system has a finite number (two) of ground states which, by 
Theorem B.21 and PS theory, give rise to different and coexisting Gibbs measures at 
low-enough temperature. By symmetry considerations this coexistence takes place at 
zero values of the symmetry-breaking field. An analogous argument can be used for 
all the other block-sizes bk given in (4.35). 

B.5.6 Internal Spins under Block-Averaging Transformations 

Again, we resort to the Holsztynski-Slawny criterion (Theorem B.21). The system of 
internal spins subjected to the constraint of zero average spin in every 2x2 block can 
again be written as an m-potential which is periodic (with period 2). The fundamental 
bonds are the 2x2 squares and the bonds connecting them, and the only four periodic 
ground states satisfying every bond are easily seen to be the ones depicted in Figure 9. 
Thus at sufficiently low temperature PS-theory provides the phase transition needed in 
Step 1. Again symmetry allows us to dispose of any symmetry-breaking field to follow 
the coexistence point. 

B.5.7 Internal Spins when h y^ 0. Random Field 

The result needed to prove the presence of pathologies for non-zero field in the deci- 
mation and Kadanoff examples of Section 4.3, is a direct consequence of Zahradnik's 
Theorem B.31. We apply this theorem with $o equal to the interaction for the system 
of internal spins with fully alternating image spins, and the symmetry-breaking pertur- 
bation $1 taken to be a uniform magnetic field. The random interaction is the random 
magnetic field induced by those block spins that were fiipped from "-|-" to "— " with 
probability e/(2J), as discussed in Section 4.3.6. By Theorem B.31, the resulting low- 
temperature and low-e phase diagram is only a small perturbation of the phase diagram 
for the non-random part, which is itself a small perturbation of its zero-temperature 
phase diagram. In particular, the ground-state energy for almost all realizations of the 
Hamiltonian is an almost linear function of the parameters h and e. For e sufficiently 
small, the linearity is only weakly violated, and the compensating uniform field (that 
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is, the value of h as a function of e needed to keep the system on a coexistence surface) 
is an almost linear — hence strictly increasing — function. 

C Solution of the Diophantine Equation (4.34) 

Consider the pair of Diophantine equations P = 2c^ it 1 (6, c integers > 1). The 
following intuition was suggested to us by Vincent Rivasseau: If (6, c) satisfies either 
of these equations, then b/c must be an excellent rational approximation to v2, in the 
sense that 

17)2/^2 91 1 

b/c-V2 = '\/" ~^' < -^ . (C.l) 

' b/c+ V2 ~ V2c^ ^ ^ 

(Note that, by contrast, for "typical" integer denominators c, one has inf \b/c — ^/2\ ~ 

1/c ^ 1/c^-) Now, the best rational approximations to v2 can be obtained from the 
continued fraction [234, 317] 

72-1 = — . (C.2) 

^+ — 

This suggests to consider the recursion 

which converges to ^/2 — 1 as n —■ oo (for any Xq > —2); equivalently, defining 
Vn = Xn -\- l^ we find the recursion 

Vn+l = — , C.4 

1 + yn 

which converges to v2 (for any j/o > ~1)- In particular, setting j/„ = bn/cn with 6„,c„ 
positive integers, we find the linear recursion 

bn+i = bn + 2cn (C.5a) 

Cn+l = bn + Cn (C.5b) 

Now this recursion has the remarkable property that 

&^+i-24+i = -ibl-2cl); (C.6) 

in particular, if b^ = 2c^ it 1, then b^_^_^ = 2c^_^_-^ =p 1. Therefore, if we start from 
(6o,Co) = (1,1), we generate pairs (6„,c„) which satisfy b^ = 2c^ — l (resp. b^ = 2c^ + l) 
for even (resp. odd) values of n. The explicit formula is 

bn = ^ [(1 + ^)'^+^ + (1 - ^)'^+^] (C.7a) 

cn = ^[ii + V2'r'-ii-V2'r'] (c.7b) 
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To prove that this sequence constitutes the complete set of integer solutions to P = 
2c^ lb 1, we run the iteration (C.5) backwards: given (fe, c), we define 

b' = -b + 2c (C.8a) 

c' = b-c (C.8b) 



and show that repeated apphcation of this map must eventually lead to the pair (1,1). 

Lemma C.l Let 6, c > 1 be integers satisfying P = 2(? it 1. Then 6',c' are integers 
satisfying 0<b'<b,0<c'<c and b' = 2c' =p 1. 



Proof, (a) c > 1 imphes 6^ = 2c^ ± 1 > 2c^ - 1 > 2c^ - c"^ = c"^. Hence b > c and 
c' > 0. Also b-b' = 2{b-c)> 0, so b' < b. 

(b) c > 1 imphes P = 2c^ ±1 < 2c^ + I < 2c^ + c^ = 3c^ < ic^. Hence b < 2c and 
b' > 0. Also c - c' = 2c - 6 > 0, so c' < c. 

(c) b'^ - 2c'' = (2c - by - 2(6 - c)2 = -{P - 2P) = ^1. ■ 

Since c strictly decreases at each iteration of (C.8), we must eventually reach c = 1, 
hence 6=1. Since (C.8) is the inverse of (C.5), the original pair (6, c) must be (6„, c„) 
for some n. We have therefore proven: 

Theorem C.2 A pair of integers 6, c > 1 satisfies the Diophantine equation P = 
2(? — 1 (resp. P = 2(? + \) if and only if (6, c) = (6„,c„) for some even (resp. odd) 
integer n > 0. 

In particular, the block sizes b for which the majority-rule construction in Section 4.3.4 
works are 62,64,66,... = 7,41,239,1393,8119,... . 

Remarks. 1. After completing this proof, we learned that it was previously 
published by Theon of Smyrna [344] circa 130 A.D., and probably goes back to the 
Pythagorean school [301]; the identity (C.6) is proven geometrically in Euclid's Ele- 
ments (Book n. Proposition 10). The special case 6 = 7, c = 5 is mentioned in Plato's 
Republic (546 C), though without the renormalization-group application. For a history, 
see Heath [191], Dickson [76, Chapter XH], Tannery [343] and Mugler [270]. 

2. One might wonder about the rational approximants to v^ obtained by using 
Newton's method J/ 1— ^ |(j/ + 2/j/). Setting y = b/c, we obtain the nonlinear recursion 
(6, c) I—)- (6^-|-2c^, 26c) = (6, c). It is straightforward to prove by induction that if (6, c) = 
(6„,c„), then (6, c) = (62^+1, C2„+i). So Newton's method generates a subsequence of 
the continued-fraction sequence. 

It is natural to ask whether our construction in Section 4.3.4 can be extended to 
Ising models in dimension d > 3. Clearly this works if and only if the block size 6 and 
island size c satisfy the Diophantine equation 

1 + 6^^ = 2c^ (C.9) 
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Unfortunately, we suspect that for d > 3 there are no positive-integer solutions to 
(C.9) other than b = c = 1. The best results we have been able to glean from the 
mathematical literature are summarized in Theorems C.3 and C.4: 

Theorem C.3 Let I be a positive integer satisfying any one of the following three 
conditions: 

(a) I = 3; or 

(b) I = 4; or 

(c) I is a regular prime^^ such that the exponent of 2 mod I is"^^ either [l — l)/2 or 
even, and such that 2'"^ ^ 1 (mod P). 

Let d be any multiple of I (including I itself). Then the only positive-integer solutions 
to x"^ -\- y"^ = 2z'^ are x = y = z. In particular, the only positive-integer solution to 
l + b^ = 2c^ isb = c = 1. 

Theorem C.3 has a long history. Obviously, if the theorem holds for any given 
power /, it trivially holds also for multiples of that power. The case / = 3 was proven 
by Euler sometime before 1770 [76, p. 572]; much more general results are now known 
[265, pp. 126, 203, 220]. The case / = 4 is a specialization of a theorem proven by 
Schopis in 1825 [76, p. 618] [265, p. 18]; again, more general results are now known 
[265, pp. 271, 274, 276]. The case 1 = 5 was proven in the mid-nineteenth century, but 
the proper attribution is unclear. Denes [74] credits Dirichlet [81], but our reading of 
Dirichlet's paper indicates that he treated numerous cases of x^ -\- y^ = Az^ but not 
A = 2 (see also [76, p. 735]). The correct attribution seems to be V.A. Lebesgue in 
1843 [235] [76, p. 738]. See also [76, pp. 755-756] and [265, p. 276] for generalizations. 

The case (c) was proven by Denes [74] in 1952. To interpret it, note that the 
first irregular primes are 37,59,67,101,... . The first regular primes for which the 
exponent condition fails are 31,73,89,127,... . Finally, the only primes / < 6 X 10^ 
(and indeed the only ones currently known) for which 2'"^ = 1 (mod P) are 1093 
and 3511 [308, pp. 263-264]. Thus, the first primes for which condition (c) fails are 
31, 37, 59, 67, 73, 89, 101, .... In particular. Theorem C.3 holds for all exponents d < 
100 except possibly 31, 37, 59, 62, 67, 73, 74, 89.®° 

Theorem C.4 For arbitrary d > 3, there is at most one positive-integer solution to 
l^b'^ = 2c'^ other than b = c=l. 



^®A prime / > 3 is called regular if it does not divide any of the numerators of the Bernoulli numbers 
B2, B4, . . . , Bi-s expressed in lowest terms. 

^^The exponent of 2 mod / is the smallest integer n > 1 such that 2" = 1 (mod /). 

^'^Denes' paper [74] contains a list of all primes / < 619 for which condition (c) fails. This list has, 
however, a few mistakes: The primes 389 and 613 should be added to the list of irregular primes [247]; 
the exponent of 2 mod 281 (resp. mod 563) is 70 (resp. 562); and the final list in his article should 
read 31, 73, 89, 127, 151, 223, 337, 431, 439, 601. 
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Theorem C.4 is a special case of a result of Domar [98], who proves that for arbitrary 
integers A,B >1 and d > 5, the equation \Ax — By \ = 1 has at most two solutions 
in positive integers x^y. See also [111]. 

The conjectured unsolvability of x"^ -\- y"^ = 2z'^ for J > 3 is a special case of an 
"extended Fermat's last theorem" which might conceivably be true [142, 62]: 

Conjecture C.5 Let d and a he integers, with d > 3 and 1 < a < d. Then there are 
no solutions of x"^ -\- y"^ = az"^ in nonzero integers, except for x = y = z in the case 
a = 2. 

We find it amusing that a real problem in physics should be connected with Fermat's 
last theorem, but we think that this is an artifact of our method of proof and not an 
intrinsic fact about majority-rule transformations. Indeed, we suspect that Theorem 
4.5 holds for all block sizes b > 2 and all dimensions (/ > 3, without regard for subtle 
number-theoretic properties. But we could be wrong. 
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